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These lecture notes are based on 3 courses in non-relativistic quantum mechanics that are given at BGU: "Quan- 
tum 2" (undergraduates), "Quantum 3" (graduates), and "Advanced topics in Quantum and Statistical Mechanics" 
(graduates). The lecture notes are self contained, and give the road map to quantum mechanics. However, they do 
not intend to come instead of the standard textbooks. In particular I recommend: 

[1] L.E.Ballentinc, Quantum Mechanics (hbrary code: QC 174.12.B35). 

[2] J.J. Sakurai, Modern Quantum mechanics (library code: QC 174.12.S25). 

[3] Feynman Lectures Volume III. 

[4] A. Messiah, Quantum Mechanics, [for the graduates] 

The major attempt in this set of lectures was to give a self contained presentation of quantum mechanics, which is not 
based on the historical "quantization" approach. The main inspiration comes from Ref.[3] and Ref.[l]. The challenge 
was to find a compromise between the over- heuristic approach of Ref . [3] and the too formal approach of Ref . [1] . 

Another challenge was to give a presentation of scattering theory that goes well beyond the common undergraduate 
level, but still not as intimidating as in Ref. [4]. A major issue was to avoid the over emphasis on spherical geometry. 
The language that I use is much more suitable for research with "mesoscopic" orientation. 

Some highlights for those who look for original or advanced pedagogical pieces: The EPR paradox. Bell's inequality, 
and the notion of quantum state; The 4 postulates of quantum mechanics; Berry phase and adiabatic processes; Linear 
response theory and the Kubo formula; Wigner-Weyl formalism; Quantum measurements; Quantum computation; 
The foundations of Statistical mechanics. Note also the following example problems: Analysis of systems with 2 
or 3 or more sites; Analysis of the Landau-Zener transition; The Bose-Hubbard Hamiltonian; Quasi ID networks; 
Aharonov-Bohm rings; Various problems in scattering theory. 



Credits 

The first drafts of these lecture notes were prepared and submitted by students on a weekly basis during 2005. 
Undergraduate students were requested to use HTML with ITEX formulas. Typically the text was written in Hebrew. 
Graduates were requested to use Latex. The drafts were corrected, integrated, and in many cases completely re- written 
by the lecturer. The English translation of the undergraduate sections has been prepared by my former student Gilad 
Rosenberg. He has also prepared most of the illustrations. The current version includes further contributions by my 
PhD students Maya Chuchem and Itamar Sela. I also thank my colleague Prof. Yehuda Band for some comments on 
the text. 

The arXiv versions are quite remote from the original (submitted) drafts, but still I find it appropriate to list the 
names of the students who have participated: Natalia Antin, Roy Azulai, Dotan Babai, Shlomi Batsri, Ynon Ben- 
Haim, Avi Ben Simon, Asaf Bibi, Lior Blockstein, Lior Boker, Shay Cohen, Liora Damari, Anat Daniel, Ziv Danon, 
Barukh Dolgin, Anat Dolman, Lior Eligal, Yoav Etzioni, Zeev Freidin, Eyal Gal, Ilya Gurwich, David Hirshfeld, 
Daniel Horowitz, Eyal Hush, Liran Israel, Avi Lamzy, Roi Levi, Danny Levy, Asaf Kidron, liana Kogen, Roy Liraz, 
Arik Maman, Rottem Manor, Nitzan Mayorkas, Vadim Milavsky, Igor Mishkin, Dudi Morbachik, Ariel Naos, Yonatan 
Natan, Idan Oren, David Papish, Smadar Reick Goldschmidt, Alex Rozenberg, Chen Sarig, Adi Shay, Dan Shenkar, 
Idan Shilon, Asaf Shimoni, Raya Shindmas, Ramy Shnciderman, Elad Shtilcrman, Eli S. Shutorov, Ziv Sobol, Jenny 
Sokolcvsky, Alon Soloshenski, Tomer Tal, Oren Tal, Amir Tzvieli, Dima Vingurt, Tal Yard, Uzi Zecharia, Dany 
Zemsky, Stanislav Zlatopolsky. 



Warning 

This is the third version. Still it may contain typos. 
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[1] Introduction 



== [1-1] The Building Blocks of the Universe 

The world we live in consists of a variety of particles which are described by the " standard model" . The known 
particles are divided into two groups: 

• Quarks: constituents of the proton and the neutron, which form the ^ 100 nuclei known to us. 

• Leptons: include the electrons, muons, taus, and the neutrinos. 

In addition, the interaction between the particles is by way of fields (direct interaction between particles is contrary 
to the principles of the special theory of relativity). These interactions are responsible for the way material is 
" organized" . The gravity field has yet to be incorporated into quantum theory. We will be concerned mostly with 
the electromagnetic interaction. The electromagnetic field is described by the Maxwell equations. 

^^^^= [1.2] What Happens to a Particle in an Electromagnetic Field? 



Within the framework of classical electromagnetism, the electromagnetic field is described by the scalar potential 
V{x) and the vector potential A{x). In addition one defines: 

B = WxA (1) 

^ . -If -vv 

c ot 

We will not be working with natural units in this course, but from now on we are going to absorb the constants c and 
e in the definition of the scalar and vector potentials: 

-A A, eV V (2) 

c 

-B ^ B, e£ ^ £ 

c 

In classical mechanics, the effect of the electromagnetic field is described by Newton's second law with the Lorentz 
force. Using the above units convention we write: 

i^-{£-Bxv) (3) 
m 

The Lorentz force dependents on the velocity of the particle. This seems arbitrary and counter intuitive, but we shall 
see in the future how it can be derived from general and fairly simple considerations. 

In analytical mechanics it is customary to derive the above equation from a Lagrangian. Alternatively, one can use a 
Legendre transform and derive the equations of motion from a Hamiltonian: 

. _ dH 

^ dx 
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Where the Hamiltonian is: 

■H{x,p)^^{p-A{x)f + V{x) (5) 



[1.3] Canonical Quantization 



The historical method of deriving the quantum description of a system is canonical quantization. In this method we 
assume that the particle is described by a "wave function" that fulfills the equation: 

This seems arbitrary and counter-intuitive. In this course we will not use this historical approach. Rather, we will 
construct quantum mechanics in a natural way using only simple considerations. Later we will see that classical 
mechanics can be obtained as a special limit of the quantum theory. 



== [1-4] Second Quantization 

The method for quantizing the electromagnetic field is to write the Hamiltonian as a sum of harmonic oscillators 
(normal modes) and then to quantize the oscillators. It is exactly the same as finding the normal modes of spheres 
connected with springs. Every normal mode has a characteristic frequency. The ground state of the field (all the 
oscillators are in the ground state) is called the "vacuum state". If a specific oscillator is excited to level n, we say 
that there are n photons with frequency lo in the system. 

A similar formalism is used to describe a many particle system. A vacuum state and occupation states are defined. 
This formalism is called "second quantization". A better name would be "formalism of quantum field theory". 

In the first part of this course we will not talk about " second quantization" : The electromagnetic field will be described 
in a classic way using the potentials V{x)^ Aix)^ while the distinction between fermions and bosons will be done using 
the (somewhat unnatural) language of " first quantization" . 



== [1-5] Definition of Mass 

The "gravitational mass" is defined using a scale. Since gravitational theory is not includes in this course, we will not 
use that definition. Another possibility is to define " inertia! mass" . This type of mass is determined by considering 
the collision of two bodies: 

mwi + m2V2 = miMi + m2U2 (7) 

So: 

mi U2 - V2 

— — \°) 

m2 Vl — ui 

In order to be able to measure the inertial mass of an object, we must do so in relation to a reference mass. In other 
words: we use an arbitrary object as our basic mass unit. 

Within the framework of quantum mechanics the above Newtonian definition of inertial mass will not be used. Rather 
we define mass in an absolute way. We shall define mass as a parameter in the " dispersion relation" , and we shall see 
that the units of mass are: 



(9) 
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If we choose to set the units of mass in an arbitrary way to be kg, then a units conversion scheme will be necessary. 
The conversion scheme is simply a multiplication by the Planck constant: 

m[kg] = hm (10) 



[1.6] The Dispersion Relation 



It is possible to prepare a "monochromatic" beam of (for example) electrons that all have the same velocity, and 
the same De-Brogiie wavelength. The velocity of the particles can be measured by using a pair of rotating circular 
plates (discs). The wavelength of the beam can be measured by using a diffraction grating. We define the particle's 
momentum ("wave number") as: 

p ~ 27r/ wavelength (11) 

It is possible to find (say by an experiment) the relation between the velocity of the particle and its momentum. This 
relation is called the "dispersion relation". For low velocities (not relativistic) the relation is approximately linear: 

v = —p (12) 
m 

This relation defines the "mass" parameter and also the units of mass. 



[1.7] Spin 



Apart from the degrees of freedom of being in space, the particles also have an inner degree of freedom called "spin" 
(Otto Stern and Walter Gerlach 1922). We say that a particle has spin s if its inner degree of freedom is described by a 
representation of the rotations group of dimension 2s + 1. For example, "spin i" can be described by a representation 
of dimension 2, and "spin 1" can be described by a representation of dimension 3. In order to make this abstract 
statement clearer we will look at several examples. 

• Electrons have spin ^, so 180° difference in polarization ("up" and "down") means orthogonality. 

• Photons have spin 1, so 90° difference in linear polarizations means orthogonality. 

If we position two polarizers one after the other in the angles that were noted above, no particles will pass through. 
We see that an abstract mathematical consideration (representations of the rotational group) has very "realistic" 
consequences. 



[2] Digression: The classical description of nature 



= [2.1] The Classical Effect of the Electromagnetic Field 

The electric field S and the magnetic field B can be derived from the vector potential A and the electric potential V: 

£.-V,'-if (13, 
B = V xA 

The electric potential and the vector potential arc not uniquely determined, since the electric and the magnetic fields 
are not affected by the following changes: 

V ^ V = V --^ (14) 
c ot 

A A = A + Wk 

Where A(x,t) is an arbitrary scalar function. Such a transformation of the potentials is called "gauge". A special 
case of "gauge" is changing the potential V by an addition of a constant. 

Gauge transformations do not affect the classical motion of the particle since the equations of motion contain only 
the derived fields £,B. 

— r = — e£ Bxx (15) 

This equation of motion can be derived from the Langrangian: 

1 e 

C(x, x) = -mx^ + -xA{x, t) - eV{x, t) (16) 

2 c 

Or, alternatively, from the Hamiltonian: 

■H{x,p)^^{p--Af +eV (17) 
2m c 



=== [2.2] Lorentz Transformation 

The Lorentz transformation takes us from one reference frame to the other. A Lorentz boost can be written in matrix 
form as: 



/ 


7 


-7/3 




-7/3 


7 







1 


V 





1 



(18) 



Where j3 is the velocity of our reference frame relative to the reference frame of the lab, and 



(19) 



We use units such that the speed of hght is c = 1. The position of the particle in space 



X 



and we write the transformations as: 



x' = S-x 



We shah see that it is convenient to write the electromagnetic field as: 



£i £2 £3 \ 

£1 63 -62 

£2 -63 Bi 

,£3 B2 -Bi / 



We shall argue that this transforms as: 



F' ^ SFS~^ 



or in terms of components: 



£' ~ £\ B' = B\ 

4 = 7(^^2-/3^3) bI = ^{B2 + I3£3) 

£'^^^{£3 + l3B2) B'^^^{B3-I3£2) 



[2.3] Momentum and energy of a particle 



Let us write the displacement of the particle as: 



dx = 



'dt\ 

dx 

dy 
\dzj 



We also define the proper time (as measured in the particle frame) as: 



dr^ — dt^ — dx^ — dy^ — dz^ = (1 — — Vy^ — Vz'^)dt^ 



dr = \Jl - v'^dt 



The relativistic velocity vector is: 
dx 



U : 



dr 
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and obviously: 

-ul^ul-ul^l (28) 
We also use the notation: 

p..u. (29) 

According to the above equations we have: 

E^~pl-pl-pl = rr,^ (30) 

and we can write the dispersion relation: 

E = +p2 (31) 

P 

We note that for non-relativistic velocities pi w mvi for i = 1, 2, 3 while: 

^ (it m 1 9 , , 

E = m ^ , =r^m + -m7.2+ (32) 



== [2.4] Equations of Motion for a Particle 

The non-relativistic equations of motion for a particle in an electromagnetic field are: 

^=rr,^ = e£~eBxv (33) 
dt dt ^ ' 

The rate of change of the particle's energy E is: 

— = f.v^e£-v (34) 

The electromagnetic field has equations of motion of its own: Maxwell's equations. As we shall see shortly Maxwell's 
equations are Lorentz invariant. But Newton's laws as written above are not. In order for the Newtonian equations 
of motion to be Lorentz invariant we have to adjust them. It is not difficult to see that the obvious way is: 

— = m— = eFu 35) 
dt dt ^ ^ 

To prove the invariance under the Lorentz transformation we write: 

^ = 4(5P) = ^-T-P = S{eFu) = eSFS~\Su) = eF'u' (36) 
dr ar dr 



Hence we have deduced the transformation F' = SFS ^ of the electromagnetic field. 
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[2.5] Equations of Motion of the Field 



Back to Maxwell's equations. A simple way of writing them is 



(37) 



Where the derivative operator d, and the four-current J, are defined as: 



d = 



\ Ox/ 



at ^(1, A, A, A) 

i9i' dx^ dy^ dz 



(38) 



and: 



J y 



— (P: Jxi Jyj Jz) 



(39) 



The Maxwell equations are invariant because J and d transform as vectors. For more details see Jackson. An 
important note about notations: in this section we have used what is called a " contravariant" representation for the 
column vectors. For example u = column(wf , Ua;, Uj,, Uz). For the "adjoint" we use the "covariant" representation 
u = row(uj, — Uj;, — Wy, —Uz)- Note that v)u = [utY' — (u^.)^ — i'^y)'^ ~ (""z)^ is ^ Lorentz scalar. 



[3] Hilbert space 



== [3.1] Linear Algebra 

In Euclidean geometry, three dimensional vectors can be written as: 

U = Uiei + 1*262 + 1*363 

Using Dirac notation wc can write the same as: 

\u) = ui\ei) + U2\e2) + usles) 
Wc say that the vector has the representation: 




The operation of a linear operator A is written as \v) = A\u) which is represented by: 
vi\ I All A12 AiA fui\ 

V2 \ = \ A21 A22 A23 
V3J \Ai3 A23 A33J XusJ 

or shortly as Vi = AijUj. 

Thus the linear operator is represented by a matrix: 







(All 


A12 


A 


A h- 


^ A^J = 1 


A21 


A22 


A^ 




Ul3 


A23 


A 



=^= [3.2] Orthonormal Basis 

We assume that an inner product {u\v) has been defined. From now on we assume that the basis has been 
be orthonormal: 

{ei\ej) = Si-i 

In such a basis the inner product (by linearity) can be calculated as follows: 

{u\v) ~ U*iVl + U2V2 + "5^3 

It can also be easily proved that the elements of the representation vector can be calculated as follows: 

Uj = {ej\u) 
And for the matrix elements we can prove: 

Aij = (cil^lej) 
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[3.3] Completeness of the Basis 



In Dirac notation the expansion of a vector is written as: 

\u) = |ei)(ei|M) + \e2)(e2\u) + \e3){e3\u) (49) 
which impUes 

1 = |ei)(ei| + |e2)(e2| + |e3)(e3| (50) 
Above 1 stands for the identity operator: 
/I 0\ 

1 ^ -^'^^ = 1 (^1) 
\0 1/ 

Now we can define the "completeness of the basis" as \ej){ej\ = 1 where = \ej){ej\ are cahed "projector 
operators" . Projector operators have eigenvahies 1 and 0. For example: 

/I 0\ 

(52) 
\o 0/ 



== [3.4] Operators 

Definition: an adjoint operator is an operator which satisfies the following relation: 

{u\Av) = (53) 

If we substitute the basis vectors in the above relation we get the equivalent matrix-style definition {A^)ij = A*^. In 
what follows we are interested in "normal" operators that are diagonal in some orthonormal basis. [Hence they satisfy 
A'^ A = AA^]. Of particular importance are Hermitian operators [A^ = A] and unitary operators [A^A =1]. It follows 
from the discussion below that any normal operator can be written as a function f{H) of an Hermitian operator H. 

Say that we have a normal operator A. This means that there is a basis {|a)} such that A is diagonal. This means 
that: 

A = ^|a)a(aH^aP'^ (54) 

a a 

In matrix representation this can be written as: 

^ai 0\ /l 0\ /O 0\ /O 0^ 

02 =ai 000 +a2 010 +a3 000| (55) 



03/ VO 0/ VO 0/ \0 1 



Thus we see that any normal operator is a combination of projectors. 



It is useful to define what is meant by _B = /(^) where /() is an arbitrary function. Assuming that A = |a)a(a|, it 
follows by definition that B = J2 k)/('^)('^l- Another useful rule to remember is that if A\k) = B\k) for some complete 
basis k, then it follows by linearity that = B\ip) for any vector, and therefore A = B. 
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Hermitian operators are of particular importance. By definition they satisfy = H and hence their eigenvalues 
are real numbers (A* = A^). Another class of important operators are unitary operators. By definition they satisfy 
= U^^ or equivalently U^U = 1. Hence their eigenvalues satisfy A*Ar = 1, which means that they can be written 

as: 



U = 



El 



(56) 



where H is Hermitian. In fact it is easy to see that any normal operator can be written as a function of some H . We 
can regard any H with non-degenerate spectrum as providing a specification of a basis, and hence any other operator 
that is diagonal in that basis can be expressed as a function of that H. An operator which is not "normal" can be 
expressed as Q = A + iB where A and B arc non-commuting Hermitian operators. Operator in "normal" iff A and 
B commute. 



^^^^= [3.5] Change of basis 

Definition of T: 

Assume we have an "old" basis and a "new" basis for a given vector space. In Dirac notation: 

old basis = { |a = 1), |a = 2), |a = 3), . . . } (57) 
new basis = { \a = I), \a ~ 2), |a = 3), . . . } 

The matrix Ta^a whose columns represent the vectors of the new basis in the old basis is called the "transformation 
matrix from the old basis to the new basis" . In Dirac notation this may be written as: 

a 

In general, the bases do not have to be orthonormal. However, if they are orthonormal then T must be unitary and 
we have 

Ta,a = {a\a) (59) 

In this section wc will discuss the general case, not assuming orthonormal basis, but in the future we will always work 
with orthonormal bases. 

Definition of S: 

If we have a vector-state then we can represent it in the old basis or in the new basis: 

IV;) = 5]^a|a) (60) 

a 

Q 

So, the change of representation can be written as: 

V'a = ^Sa,a1pa (61) 

a 

Or, written abstractly: 



■0 = Sip 



(62) 
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The transformation matrix from the old representation to the new representation is: S = T ^. 
Similarity Transformation: 

A unitary operation can be represented in cither the new basis or the old basis: 

Va = ^^0,6-06 (63) 

a 

If a = ^Aa.fl-ijjf} 

a 

The implied transformation between the representations is: 

A = SAS-^ = T-^AT (64) 
This is called a similarity transformation. 

=== [3.6] Generalized spectral decompositions 



Not any operator is normal: that means that not any matrix can be diagonalized by a unitary transformation. In 
particular we have sometime to deal with non-Hermitian Hamiltonian that appear in the reduced description of open 
systems. For this reason and others it is important to know how the spectral decomposition can be generalized. The 
generalization has a price: either we have to work with non-orthonormal basis or else we have to work with two 
unrelated orthonormal sets. The latter procedure is known as singular value decomposition (SVD). 

Given a matrix A we can find its eigenvalues Ar, which wc assume below to be non degenerate, without making any 
other assumption we can always define a set |r) of right eigenstates that satisfy A\r) = Xr\r). We can also define a 
set |f) of left eigenstates that satisfy A^f) = A*|f). Unless A is normal, the r basis is not orthogonal, and therefore 
(r|yl|s) is not diagonal. But by considering (r|A|s) we can prove that (f|s) = if ?' 7^ s. Hence we have dual basis 
sets, and without loss of generality we adopt a normalization convention such that 

{f\s)=Sr,, (65) 
so as to have the generalized spectral decomposition: 

A ^ = T [diag{AJ] T^i (66) 

r 

where T is the transformation matrix whose columns arc the right eigenvectors, while the rows of T^^ are the 
left eigenvectors. In the standard decomposition method A is regarded as describing stretching/squeezing in some 
principal directions, where T is the transformation matrix. The SVD procedure provides a different type of decom- 
positions. Within the SVD framework A is regarded as a sequence of 3 operations: a generalized "rotation" followed 
by stretching/squeezing, and another generalized "rotation". Namely: 

A - - U v/diag{p,} W (67) 

r 

Here the positive numbers Pr are called singular values, and Ur and Vr are not dual bases but unrelated orthonormal 
sets. The corresponding unitary transformation matrices are U and V. 



[3.7] Notational conventions 



In Mathematica there is a clear distinction between dummy indexes and fixed values. For example f{x_) = 8 means 
that f{x) = 8 for any x, hence a; is a dummy index. But if a; = 4 then f{x) = 8 means that only one element of the 
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vector f{x) is specified. Unfortunately in the printed mathematical literature there are no clear conventions. How 
ever the tradition is to use notations such as f{x) and /(x') where x and x' are dummy indexes, while /(xg) and 
f{xi) where xq and xi are fixed values. Thus 



A- r ^ 



(68) 



A. 



for io = 2 and jo = 1 



Another typical example is 



Tx.k 



{x\k) 

(a;|fco) 

{x\k) 



(69) 



In the first equality we regard {x\k) as a matrix: it is the transformation matrix form the position to the momentum 
basis. In the second equality we regard the same object (with fixed fcg) as a state vector. In the third equality we 
define a set of " wavefunctions" . 

We shall keep the following extra convention: representation indexes are always lower indexes. The upper indexes are 
reserved for specification. For example 



Y^''"'{9,ip) = {9,ip\£m) = spherical harmonics 

<p"(x) = {x\n) = harmonic oscillator eigenfunctions 



(70) 



Sometime it is convenient to use the Einstein summation convention, where summation over repeated dummy indexes 
is implicit. For example: 



f{0,^) = Y.{0,^\em){em\f) = /„„r^™(0,^) 



(71) 



em 



In any case of ambiguity it is best to translate everything into Dirac notations. 



[3.8] The separation of variables theorem 



Assume that the operator Ti. commutes with an Hermitian operator A. It follows that if |a, ly) is a basis in which A 
is diagonalized, then the operator Ti. is block diagonal in that basis: 



{a,iy\A\a' = aSaa'S, 
{a,iy\n\a',:^') = Saa'H 



(a) 



(72) 
(73) 



Where the top index indicates which is the block that belongs to the eigenvalue a. 
To make the notations clear consider the following example: 



/2 0\ 
'02000' 

9 
9 

Vo 9/ 



H 



/5 3 0\ 

3 6 

4 2 8 

2 5 9 

Vo 8 9 7/ 



5 3 
3 6 




(74) 
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Proof: [n, A] = (75) 

{a,v\HA- An\a = 
a' (a, v\U\a', v') - a{a, iy\H\a' , v') = 

(a - a')Hay,a'y' = 

a ^ a' Hav.a'v' = 



It follows that there is a basis in which both A and 7i are diagonalizcd. This is because we can diagonalize the 
matrix H block by block (the diagonalizing of a specific block does not affect the rest of the matrix). 

The best know examples for "separation of variables" are for the Hamiltonian of a particle in a centrally symmetric 
field in 2D and in 3D. In the first case is constant of motion while in the second case both [l? ^ Lz) are constants 
of motion. The full Hamiltonian and its blocks in the first case are: 



n = lp^ + V{r) - l(^pl + lLlj+v{r) (76) 
= lpl + ^ + V(r) -herep?^-i|-(r|-) (77) 



The full Hamiltonian and its blocks in the second case are: 



n = ^p^ + V{r) = ^^(^pl + l^L'j+V{r) (78) 

^iim) ^ y +^_l±^ + vir) where p?--i^r (79) 

2 2r^ r or'' 



In both cases we have assumed units such that m = 1. 



[4] A particle in an N site system 
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[4.1] N Site System 



A site is a location where a particle can be located. If we have N = 5 sites it means that we have a 5-dimensional 
Hilbert space of quantum states. Later we shall assume that the particle can "jump" between sites. For mathematical 
reasons it is conveneint to assume torus topology. This means that the next site after a; = 5 is a; = 1. This is also 
called periodic boundary conditions. 

The standard basis is the position basis. For example: \x) with x = 1,2, 3, 4, 5 [mod 5]. So we can define the position 
operator as follows: 



x\x} = x\x) 



(80) 



In this example we get: 



/I 0\ 
'02000' 
3 
4 
Vo 5/ 



(81) 



The operation of this operator on a state vector is for example: 



1^) =7|3)+5|2) 
x|V') = 21|3) + 10|2) 



(82) 



[4.2] Translation Operators 



The one-step translation operator is defined as follows: 



D\x) = \x + l) 



(83) 



For example: 



D 



/O 1\ 

' 1 ' 

10 

10 

Vo 1 o/ 



(84) 



and hence D\l) = |2) and D\2) = |3) and £>|5) = |1). Let us consider the superposition: 



|V') = ^[|1) + |2) + |3) + |4) + |5)] 



(85) 



It is clear that D\'ip) = This means that -0 is an eigenstate of the translation operator (with eigenvalue e'°). The 
translation operator has other eigenstates that we will discuss in the next section. 
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[4.3] Momentum States 



The momentum states are defined as follows: 

\k) ^ -^e''^- (86) 



2tt 

k = 'j^i^i " ~ integer mod (N) 



In the previous section we have encountered the fc = momentum state. In Dirac notation this is written as: 



or equivalently as 
{x\k) = -^=e 



ikx 



While in old fashioned notation it is written as: 

i,l = {x\k) (89) 

Where the upper index k identifies the state, and the lower index x is the representation index. Note that if x were 
continuous then it would be written as tp''{x). 

The k states are eigcnstatcs of the translation operator. This can be proved as follows: 

X X ^ x' ^ x' ^ 

Hence we get the result: 

D\k)=e-'^\k) (91) 

and conclude that |fc) is an eigenstate of D with an eigenvalue e"'*^. Note that the number of independent eigenstates 
is TV. For exmaple for a 5-site system we have e'*''' = e*'''^ . 



== [4.4] Momentum Operator 

The momentum operator is defined as: p\k) = k\k) From the relation D\k) = e~**''|fc) it follows that D\k) — c^^^\k). 
Therefore we get the operator identity: 

b = e-'P (92) 

We can also define 2-step, 3-step, and r-step translation operators as follows: 



I)(2) = (£))2 = e-'^P 
b{3) = (bf = e-'^P 
b{r) = {by = e-'^'P 



(93) 
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[5] The continuum limit 



[5.1] Definition of the Wave Function 



Below we will consider a site system in the continuum limit, e — > is the distance between the sites, and L is the length 
of the system. So, the number of sites is iV = L/e ^ oo. The eigenvalues of the position operator are = e x integer 
We use the following recipe for changing a sum into an integral: 



The definition of the position operator is: 

x\xi) = Xi\xi) (95) 
The representation of a quantum state is: 

\^P) = = J dx^{x)\x) (96) 

i 

where we define 

i^(x^) = (97) 



\x) = —p\xi) [infinite norm!] (98) 



The Dirac delta function is defined as (5(0) = 1/e and zero otherwise. We have orthonormality in a generalized sense, 
namely {x\x') = 5{x'~x'). Note that we still have ^{x) = (a;|'0), and the normalization of the "wave function" is: 



[5.2] Momentum States 



The definition of the momentum states using this normalization convention is: 

^^^{x) = -^c'^'' (100) 
yL 

Where the eigenvalues are: 
27r 

k ~ — X mteger (101) 
L 
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We use the following recipe for changing a sum into an integral: 

We can verify the orthogonality of the momentum states: 

(fcalfci) = Y,{k2\x){x\h) = ^ V'^V^ = / dxyj'^^ixfyj'^^ix) = ^ f dxe*('=^-'=^)- = 4,,^^ (103) 

XX'' •' 

The transformation from the position basis to the momentum basis is: 

^-fc = {k\^p) = ^(fc|x)(x|V') = / i}''ix)*'ilj{x)dx = ^ / Hx)e~'^''dx (104) 



VI. 

For convenience we will define: 

'J'(fc) = Vl^u (105) 
Now we can write the above relation as a Fourier transform: 

■^{k) = j il}{x)e-''''' dx (106) 

Or. in the reverse direction: 

/dk 
-^{ky'^^ (107) 

== [5.3] Translations 

We define the translation operator: 

D{a)\x) ^ \x + a) (108) 
If \'ip) is represented by ^{x) then D[a)\il)) is represented by ij}{x — a). In Dirac notation we may write: 

{x\D{a)\'ip) ^ {x - a\i;) (109) 

This can obviously be proved easily by operating on the "bra". However, for pedagogical reasons we will also 
present a longer proof: Given 

\i,) = Y,^x\x) (110) 

Then 

D(a)l^) =^^.(a;)|a; + a) =^7^(a;'-a)|x') =5]^(x-a)|x> (111) 
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[5.4] The Momentum Operator 



The momentum states are eigenstates of the translation operators: 

15(0)1^) = e^^'^lfc) (112) 
The momentum operator is defined the same as in the discrete case: 

p\k)^k\k) (113) 
Therefore the following operator identity emerges: 

b{a) = e-"^P (114) 
For an infinitesimal translation: 

D{5a) = 1 - i5ap (115) 
We see that the momentum operator is the generator of the translations. 



[5.5] The differential representation 



In the continuum limit the operation of p can be realized by a differential operator. We have already proved the 
identity: 

{x\D{a)\^) = {x - a\il:) (116) 
Therefore for an infinitesimal translation we have: 

DiSaMij}) ^ - 5a) = Hx) - Sa-^ipix) (117) 

ax 

From here it follows that 

(x|p|V^) (118) 
ax 

In other words, we have proved the following statement: The operation of p on a wavefunction is realized by the 
differential operator ^i{d/dx). 

== [5.6] Algebraic characterization of translations 

If |.t) is an eigenstate of x with eigenvalue x, then D\x) is an eigenstate of x with eigenvalue x + a. In Dirac notations: 
x{D\x)) ^ {x + a){D\x)) for any X (119) 

Which is equivalent to: 

x{D\x)) ^ D{{x + a)\x)) for any X (120) 
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Therefore the following operator identity is implied: 

X D = D{x + a) (121) 
Which can also be written as 

[x,D]^aD (122) 

The opposite is correct too: if an operator D fulfills the above relation with another operator x, then the former is a 
translation operator with respect to the latter, where a is the translation distance. 

The above characterization applies to any type of translation operators, include "raising/lowering" operators which 
are not necessarily unitary. A nicer variation of the algebraic relation that characterizes a translation operator is 
obtained if D is unitary: 

D~^xD^x + a (123) 

If we write the infinitesimal version of this operator relation, by substituting D{Sa) = 1 — iSap and expanding to the 
first order, then we get the following commutation relation: 

[x,p]=i (124) 

The commutation relations allow us to understand the operation of operators without having to actually use them on 
wave functions. 



[5.7] Vector Operators 



Up to now we have discussed the representation of a a particle which is confined to move in a one dimensional 
geometry. The generalization to a system with three geometrical dimensions is straightforward. 

\x,y,z) = \x)®\y)®\z) (125) 

x\x,y,z) = x\x,y,z) 

y\x,y,z) = y\x,y,z) 

z\x,y,z) = z\x,y,z) 

We define a "vector operator" which is actually a "package" of three operators: 

r={x,y,z) (126) 

And similarly: 

p = {Px,Py,Pz) (127) 

A = [Ax, Ay, A,) 
Sometimes an operator is defined as a function of other operators: 

A = A(f) = {Ax{x,y,z),Ay{x,y,z),A^{x,y,z)) (128) 
For example A = f/|f p. We also note that the following notation is commonly used: 

=-P--P=pI+pI+pI (129) 
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[5.8] The Translation Operator in 3-D 



The translation operator in 3-D is defined as: 

D{a)\r) = |r + a) (130) 

An infinitesimal translation can be written as: 

£){Sa) = c-'S''-P-c-'^^yP^c-'^''^P' (131) 
= 1 — iSa^Px — iSttyPy — iSttzPz = 1 — iSa ■ p 

The matrix elements of the translation operator are: 

(r|I?(a)|r') =(53(r-(r'+a)) (132) 



[5.9] The Matrix Elements of the Momentum Operator 



In one dimension, the matrix elements of the translation operator are: 

{x\D{a)\x') ^ S{{x ~ x') - a) (133) 
For an infinitesimal translation we write: 

(a;|(l — idap)\x') = S{x — x') — dad'{x — x) (134) 
So that we get: 

{x\p\x') = -i5'{x^x') (135) 

We notice that the delta function is symmetric, so its derivative is anti-symmetric. In analogy to multiplying a 
matrix with a column vector we write: Aj^I') i-^ Aij'^ j. Let us examine how the momentum opertor operates on 
a " wavefunction" : 

p\^) ^ ^p,,.*,. =y"(a;|p|a;')«'(a;')rf2:'= (136) 

x' 

f d d 

= ~i S(x' - x)^—'^(x')dx' -i— *(x) 
J ox' ox 

Therefore: 

d 

p\^) ^ -i—'^ix) (137) 
The generalization of the previous section to three dimensions is straightforward: 

^ (-4*, -4^, -4*) (138) 
or in simpler noation p\'if) i-^ — iV'I'. We also notice that p^l^*) i-^ — V^Vl'. 
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[6] Rotations 



== [6.1] Euclidean Rotation Matrix 

The Euclidean Rotation Matrix R^{^) is a 3 x 3 matrix that rotates the vector r. 



f ROTATION\ 



y^,]={ MATRIX )\y\ (139) 

The Euchdean matrices constitute a representation of dimension 3 of the rotation group. The parametrization of a 
rotation is done using three numbers which are kept in a vector: $. The three parameters are: Two parameters of 
the axis of rotation: 6*, Lp. How much to rotate (the length of the vector): $. 

= (sin 61 cos 0, sin sin 0, cosO) (140) 

A 3 X 3 small angle rotation of r can be written as: 

i?^((5$)r r + (5$ X r (141) 



== [6.2] The Rotation Operator Over the Hilbert Space 

The rotation operator over the Hilbert space is defined (in analogy to the translation operator) as: 

i?($)|r) EE |i?^($)r) (142) 

This operator operates over an infinite dimension Hilbert space (the standard basis is an infinite number of " sites" in 
the three-dimensional physical space). Therefore, it is represented by an infinite dimension matrix: 

R,,, = {r'\R\r) = {r'\R^r) = S{r' - R^r) (143) 

That is in direct analogy to the translation operator which is represented by the matrix: 

Dr'r = {r'\D\r) = {r'\r + a) = 5{r' - {r + a)) (144) 

As discussed in a previous section if a wavefunction i/'(r) is translated by D(a) then it becomes Tp{r — a). In complete 
analogy, if the wavefunction is rotated by R{^), then it becomes il}{R^{—^)r). 



^^^^= [6.3] Which Operator is the Generator of Rotations? 

The generator of rotations (the " angular momentum operator" ) is defined in analogy to the definition of the generator 
of translations (the "linear momentum operator"). In order to define the generator of rotations around the axis n we 
will look at an infinitesimal rotation of an angle 6^n. An infinitesimal rotation is written as: 

R{6<S>n) = 1 - i5$i„ (145) 

Below we will prove that the generator of rotations around the axis n is: 

L„ = n • (r X p) (146) 
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Where: 



f = {x,y,z) (147) 

P = {Px,Py,Pz) 



Proof: We shall show that both sides of the equation give the same result if they operate on any basis state |r). This 
means that we have an operator identity. 

R{5$)\r) = \R^{6h)r) ^ \r + 5$ x r) ^ D{SS x r)|r) (148) 
= [i - i{S$ X r) • p]|r) = [1 - ip • (5$ x r]|r) = [1 - ip ■ 5$ x r]|r) 

So we get the following operator identity: 

R{S^) = i - ip • (5$ X f (149) 
Which can also be written (by exploiting the cyclic property of the triple vectorial multiplication) : 

R{5$) = 1 - i6$ ■ {r X p) (150) 

From here we get the desired result. Note: The more common procedure to derive this identity is based on expanding 
the rotated wavefunction tp(R^ {—d^)r) = 'ipir — (5$ x r), and exploiting the association p i— > —iV. 

== [6.4] Algebraic characterization of rotations 

A unitary operator D realizes a translation in the basis which is determined by an observable x if we have the equality 

I)\x) = \x + a) for any x (151) 

This means that D\x) is an eigenstate of x with an eigenvalue x + a, which can be written as ^[ZJla:)] = (x + a)[i5|x)], 
or as xD\x) = D{x + a)\x). Therefore an equivalent way to write the defining condition of a translation operator is 

xD = D{x + a) (152) 

or 

D-^xD^x + a (153) 

By considering an infinitesimal translation we get another way of writing the same thing: 

[p,x] = -t (154) 

In complete analogy, a unitary operator R realizes rotation $ in the basis which is determined by an observable x. If 
we have the equality 

R\r) = \R^r) for any r (155) 
where R^ is the Euclidean rotation matrix. This can be written as 

R~^f,R = Rf^fj (156) 
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(with implicit summation over j). By considering an infinitesimal rotation we get another way of writing the same 
thing: 

[Jj,ri] = -ieijkVk (157) 

Thus in order to know if J generates rotations of eigenstates of a 3-component observable A, we have to check if the 
following algebraic relation is fulfilled: 

[J„Aj]^ie,jkAk (158) 

(for convenience we have interchanged the order of indices). 

== [6.5] Scalars, Vectors, and Tensor Operators 



We can classify operators according to the way that they transform under rotations. The simplest possibility is a 
scalar operator C. It has the defining property 

R-^CR = C (159) 
for any rotation, which means that 

[J^,C]^0 (160) 
Similarly the defining property of a vector is 

R-^A,R ^ RfjAj (161) 
for any rotation, which means that 

[Ji,Aj] = ieijkAk (162) 

The generalization of this idea leads to the notion of a tensor. A multicomponent observer is a tensor of rank £, if it 
transforms according to the Rfj representation of rotations. Hence a tensor of rank £ should have 2£ + l components. 
In the special case of a 3-component "vector", as discussed above, the transformation is done using the Euclidean 
matrices Rfl. 

It is easy to prove that if A and B are vector operators, then C = A ■ B is a. scalar operator. We can prove it either 
directly, or by using the commutation relations. The generalization of this idea to tensors leads to the notion of 
" contraction of indices" . 

== [6.6] Wigner-Eckart Theorem 

If we know the transformation properties of an operator, it has implications on its matrix elements. In the case of a 
scalar the operator C should be diagonal in the basis \j, m): 

Cm'm — cSm'm within a givcn J irrcduciblc subspacc (163) 

else it would follow from the "separation of variables theorem" that all the generators ( Ji) are block-diagonal in the 
same basis. Note that within the pre-specified subspace we can write c = (C), where the expectation value can be 
taken with any state. A similar theorem applies to a vector operator A. Namely, 

[^fc]m'm = 3 X [Jk]m'm within a given j irreducible subspace (164) 
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How can we determine the coefficient g? We simply observe that from the last equation it follows that 

[A-J]m'm = ff[J^]m'm = gjij + '^)Sm'm (165) 

in agreement with what we had claimed regarding scalars in general. Therefore we get the formula 



where the expectation value of the scalar can be calculated with any state. 

The direct proof of the Wigner-Eckart theorem, as e.g. in Cohen- Tannoudji, is extremely lengthy. Here we propose 
a very short proof that can be regarded as a variation on what we call the " separation of variable theorem" . From 
[Ax, Jx] = we deduce that A^ is diagonal in the Jx basis, so we can write this relation as Ax = f{Jx)- The rotational 
invariance implies that the same function /() related Ay to Jy and A^ to Jz- Next we realize that for a vector operator 
[Jz, A+] = A+ where = Ax + iAy. It follows that A+ is a raising operator in the Jz basis, and therefore must 
be expressible as = g(Jz)[J^ +iJy], where g{) is some function. It is clear that the only way to satisfy the 
equality f{Jx) + if{Jy) = 9{Jz)[Jx + ^Jy], is to have f{X) = gX and g{X) = g, where g is a constant. Hence the 
Wigner-Eckart theorem is proved. 
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Fundamentals (part II) 

[7] Quantum states / EPR / Bell / postulates 
7.1] Is the world classical? (EPR, Bell) 



We would like to examine whether the world we live in is "classical" or not. The notion of classical world includes 
mainly two ingredients: (i) realism (ii) determinism. By realism wc means that any quantity that can be measured 
is well defined even if we do not measure it in practice. By determinism we mean that the result of a measurement 
is determined in a definite way by the state of the system and by the measurement setup. We shall see later that 
quantum mechanics is not classical in both respects: In the case of spin 1/2 we cannot associate a definite value of 
Gy for a spin which has been polarized in the Gx direction. Moreover, if we measure the CTy of a polarized spin, we 
get with equal probability ±1 as the result. 

In this section we would like to assume that our world is " classical" . Also we would like to assume that interactions 
cannot travel faster than light. In some textbooks the latter is called "locality of the interactions" or "causality". It 
has been found by Bell that the two assumptions lead to an inequality that can be tested experimentally. It turns 
out from actual experiments that Bell's inequality are violated. This means that our world is either non-classical or 
else we have to assume that interactions can travel faster than light. 

If the world is classical it follows that for any set of initial conditions a given measurement would yield a definite 
result. Whether or not we know how to predict or calculate the outcome of a possible measurement is not assumed. 
To be specific let us consider a particle of zero spin, which disintegrates into two particles going in opposite directions, 
each with spin 1/2. Let us assume that each spin is described by a set of state variables. 

state of particle A = x^,a::^,... (167) 
state of particle B = 2;;f,a;^,... 

The number of state variables might be very big, but it is assumed to be a finite set. Possibly we are not aware or 
not able to measure some of these "hidden" variables. 

Since we possibly do not have total control over the disintegration, the emerging state of the two particles is described 
by a joint probabihty function p (xf , xf , ...) . We assume that the particles do not affect each other after the 
disintegration ( "causality" assumption). We measure the spin of each of the particles using a Stern-Gerlach apparatus. 
The measurement can yield either 1 or —1. For the first particle the measurement outcome will be denoted as a, 
and for the second particle it will be denoted as h. It is assumed that the outcomes a and b are determined in a 
deterministic fashion. Namely, given the state variables of the particle and the orientation 9 of the apparatus wc have 

a = a{OA)^ f{BA,xtxt-)^±^ (168) 
h = 6(0B) = /(0B,a;f,rEf,...)=±l 

where the function /() is possibly very complicated. If we put the Stern-Gerlach machine in a different orientation 
then we will get different results: 

a' = a{6'A) = / {e'A,xt xt •■•) = ±1 (169) 

b' = b{e'B) = .f{e's,xtxi...)=±i 

We have following innocent identity: 

ab + ab' + ab-ab' ^±2 (170) 

The proof is as follows: if 6 = 6' the sum is ±2a, while if 6 = —6' the sum is ±2a'. Though this identity looks innocent, 
it is completely non trivial. It assumes both "reality" and "causality" This becomes more manifest if wc write this 
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identity as 

ai9A)b{0B) + a{9A)bi9'B) + aie'^Ws) - aiO'^We) = ±2 (171) 

The realism is reflected by the assumption that both a{9A) and a{9'^) have definite values, though it is clear that in 
practice we can measure either u^Oa) or a{0'^), but not both. The causality is reflected by assuming that a depends 
on 9a but not on the distant setup parameter 9b- 

Let us assume that we have conducted this experiment many times. Since we have a joint probability distribution p, 
we can calculate average values, for instance: 

(ab) = J p {xt xf , ...) / {9a, xt ...) / (93, xf, ...) (172) 

Thus we get that the following inequality should hold: 

I (ab) + {ab') + {a'b) - {a'b') \ < 2 (173) 

This is called Bell's inequality. Let us see whether it is consistent with quantum mechanics. Wc assume that all the 
pairs are generated in a singlet (zero angular momentum) state. It is not difficult to calculate the expectation values. 
The result is 

{ab) = -cos{9a-9b) = C{9a-9b) (174) 
we have for example 

C(0°) = -1 (175) 

C{90°) = 
C(180°) = +1 

If the world were classical the Bell's inequality would imply 

\C{9a - 9b) + C{eA - 9's) + ae'^ - 9b) + C{9'a - 9's)\ < 2 (176) 
Let us take 9a = 0° and 9b = 45° and 9'j^ = 90° and 9'^ = —45°. Assuming that quantum mechanics holds we get 



7l) + {-7ll + (-71) - (+71 



^ 2V2 > 2 (177) 



It turns out, on the basis of celebrated experiments that Nature has chosen to violate Bell's inequality. Furthermore 
it seems that the results of the experiments arc consistent with the predictions of quantum mechanics. Assuming that 
we do not want to admit that interactions can travel faster than light it follows that our world is not classical. 
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== [7.2] The notion of quantum state 

A-priory we can classify the possible "statistical states" of a prepared system as follows: 

• Classical state: any measurement gives a definite value. 

• Pure state: there is a complete set of measurements that give definite value, while any other measurement gives 
an uncertain value. 

• Mixture: it is not possible to find a complete set of measurements that give a definite value. 

When we go to Nature we find that classical states do not exist. The best we can get are "pure states" . For example the 
best we can have with the spin of an electron is 100% polarization (say) in the X direction, but then any measurement 
in any different direction gives an uncertain result, except the —X direction which we call the "orthogonal" direction. 
Consequently we are inclined to postulate that polarization (say) in the non-orthogonal Z direction is a superposition 
of the orthogonal X and —X states. Similarly with photons we are inclined to postulate that linear polarization 
in the 45° direction is a superposition of the orthogonal X polarization and Y polarization states. With the same 
reasoning, and on the basis of the "two slit experiment" phenomenology, we postulate that a particle can be in a 
superposition state of two different locations. The only subtlety here is that superposition of different locations is 
not another location but rather (say) a momentum state, while superposition of different polarizations states is still 
another polarization state. Note however that also with higher spins (say spin 1), the superposition of polarized states 
can give a different type of polarization (linear / circular / elliptic). 

Having postulated that all possible pure states can be regarded as forming an Hilbert space, it still does not help us to 
define the notion of quantum state in the statistical sense. We need a second postulate that would imply the following: 
If a full set of measurements is performed (in the statistical sense) , then one should be able to predict (in the statistical 
sense) the result of any other measurement. For example, in the case of spins 1/2, say that I measure the average 
polarization Mi in the i = X,Y,Z directions. Can one predict the result for Mn, where n is a unit vector pointing 
in an arbitrary direction. According to the second postulate of quantum mechanics (see next section) the answer is 
positive. Indeed experiments reveal that M„ = n ■ M . Taking together the above two postulates, our objective would 
be to derive and predict such linear relations from our conception of Hilbert space. In the spin 1/2 example we would 
like to view Mn = n ■ M as arising from the dim=2 representation of the rotation group. Furthermore, we would like 
to derive more complicated relations that would apply to other representations (higher spins). 

== [7.3] The four Postulates of Quantum Mechanics 

The 18th century version classical mechanics can be derived from three postulates: The three laws of Newton. The 
better formulated 19th century version of classical mechanics can be derived from three postulates: (1) The state 
of classical particles is determined by the specification of their positions and its velocities; (2) The trajectories are 
determined by a minimum action principle. (3) The form of the Lagrangian of the theory is determined by symmetry 
considerations, namely Galilei invariance in the non-relativistic case. See the Mechanics book of Landau and Lifshitz 
for details. 

Quantum mechanically requires four postulates: Two postulates define the notion of quantum state, while the other 
two postulates, in analogy with classical mechanics, are about the laws that govern the evolution of quantum me- 
chanical systems. The four postulates are: 

(1) The collection of "pure" states is a linear space (Hilbert). 

(2) The expectation values of obscrvablcs obey linearity: {aX + (3Y) = ci{X) + (3(Y) 

(3) The evolution in time obey the superposition principle: a|^'*') + /3|$") a\^^) -I- 

(4) The dynamics of a system is invariant under specific transformations ("gauge", "Galilei"). 

The first postulates refer to "pure states". These are states that have been filtered. The filtering is called "prepara- 
tion". For example: we take a beam of electrons. Without "filtering" the beam is not polarized. If we measure the 
spin we will find (in any orientation of the measurement apparatus) that the polarization is zero. On the other hand. 
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if we "filter" the beam (e.g. in the left direction) then there is a direction for which we will get a definite result (in 
the above example, in the right/left direction). In that case we say that there is full polarization - a pure state. The 
"uncertainty principle" tells us that if in a specific measurement we get a definite result (in the above example, in 
the right/left direction), then there are different measurements (in the above example, in the up/down direction) for 
which the result is uncertain. The uncertainty principle is implied by postulate [1] . 

The second postulate use the notion of " expectation value" that refers to " quantum measurement" . In contrast with 
classical mechanics, the measurement has meaning only in a statistical sense. We measure "states" in the following 
way: we prepare a collection of systems that were all prepared in the same way. We make the measurement on all the 
" copies" . The outcome of the measurement is an event x = x that can be characterized by a distribution function. 
The single event can show that a particular outcome has a non-zero probability, but cannot provide full information 
on the state of the system. For example, if we measured the spin of a single electron and get dz = 1, it does not mean 
that the state is polarized " up" . In order to know if the electron is polarized we must measure a large number of 
electrons that were prepared in an identical way. If only 50% of the events give dz = 1 we should conclude that there 
is no definite polarization in the direction we measured! 



[7.4] Observables as random variables 



Observable is a random variable that can have upon measurement a real numerical value. In other words x = x 
is an event. Let's assume, for example, that we have a particle that can be in one of five sites: x = 1, 2, 3, 4, 5. An 
experimenter could measure Prob(x = 3) or Prob(p = 3(27r/5)). Another example is a measurement of the probability 
Prob(cf2 = 1) that the particle will have spin up. 

The collection of values of x is called the spectrum of values of the observable. We make the distinction between 
random variables with a discrete spectrum, and random variables with a continuous spectrum. The probability 
function for a random variable with a discrete spectrum is defined as: 

fix) = Prob(i = x) (178) 
The probability density function for a random variable with a continuous spectrum is defined as: 

f{x)dx = Prob(a; < x < x + dx) (179) 
The expectation value of a variable is defined as: 

(x) = ^/(a;)x (180) 

X 

where the sum should be understood as an integral J dx in the case the x has a continuous spectrum. Of particular 
importance is the random variable 

-P" = 4,. (181) 
This random variable equals 1 ii x = x and zero otherwise. It expectation value is the probability to get 1, namely 

fix) = {Pn (182) 
Note that x can be expressed as the linear combination xP^. 
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[7.5] Observables as operators 







In the quantum mechanical treatment we regard the observable x as an operator. Namely we define its operation on 
the basis states as S:\x) = x\x), and by linearity its operation is defined on any other state. In particular if follows 
that arc projectors. For example 



(183) 



Thus the first postulate of Quantum Mechanics implies that with any observable we can associate an Hermitian 

operator that belongs to an A^^-dimensional space of operators. The linear relations between different basis sets 
translate into a linear relations between operators. Accordingly we can span the whole space of observables by any 

set of iV^ independent operators P^. These may but do not have to be projectors: for example in the dim = 2 case 
it is convenient to take {1, (Ti, (T2, 173} as the basis set. 

From the second postulate of Quantum mechanics it follows that if A — arP^ then 

(A) = J2 ""rPr (184) 

r 

where the set of N'^ expectation values pr = {P^) fully characterizes the quantum state. In the dim = 2 case it is 
implied that any expectation value can be calculated using a linear relation of the form (A) = oq + a • M, where 
M = (((Ti), (fT2), (cts)) is the polarization vector. More generally we can create a package p = {pr} which we call 
probability matrix. The term "matrix" is used because in general the label r that distinguishes the N'^ basis operators 
is composed of two indexes. The definition of p is further discussed in the following section. First we recall the 
definition of the corresponding object in classical statistical mechanics, and then we turn to the quantum mechanical 
definition. 



7.6] Quantum Versus Statistical Mechanics 



Quantum mechanics stands opposite classical statistical mechanics. A particle is described in classical statistical 
mechanics by a probability function: 

p(x,p)dxdp = 'PYQh{x < X < X + dx,p < p < p + dp) (185) 
Optionally this definition can be expressed as the expectation value of a phase space projector 

p{x,p) = {S{x-x)S{p-p)) (186) 
The expectation value of a random variable A = A{x,p) is calculated using the definition: 

(A) ^ / A{x,p)p{x,p)dxdp (187) 



From this follows the linear relation: 

{aA + I3B) ^ a{A) + I3{B) (188) 

We see that the linear relation of the expectation values is a trivial result of classical probability theory. It assumes 
that a joint probability function can be defined. But in quantum mechanics we cannot define a "quantum state" 
using a joint probability function, as implied by the observation that our world is not "classical" . For example we 
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cannot have both the location and the momentum we defined simultaneously. For this reason, we have to use a more 
sophisticated definition of p. The more sophisticated definition is based on regarding p as the expectation value of 
projector, and taking the linearity of the expectation value as a postulate. 



== [7.7] Definition of the probability matrix 

The definition of p in quantum mechanics is based on the trivial observation that and observable A can be written as a 
linear combination of N'^ — 1 independent projectors. If we make N"^ — 1 independent measurements over a complete 
set of projectors, then we can predict the result of any other measurement. The possibility to make a prediction is 
based on taking the linearity of the expectation value as a postulate. The above statement is explained below, but 
the best is to consider the N = 2 example that comes later. 

Any Hermitian operator can be written as a combination of iV^ operators as follows: 

A = J2\^){^\A\J){J\^Y.'^^^P'' (189) 

Where P-'* = \i){j\. We notice that the = P" = are elementary projectors on the basis states. They fulfill 
the relation P' = 1- The rest of the operators can be written as P'-' = X + lY . Note that the adjoint opera- 
tors are P-** = X — lY . So for each combination of ij we have two hermitian operators X and Y . We can write 
X = 2P^ — P' — P^ , and Y = 2P^ — P' — P^ where P^ and P^ are elementary projectors. Thus we have established 
that the operator A is a combination of the N + 2[iV(iV— 1)/2] = iV^ projectors {P*, P^, P^} with one constraint. If 
we make A^" — 1 independent measurements of these projectors we can predict the result of any other measurement 
according to the equation: 

{A) = ^uPt^ = trace(Ap) (190) 

Where p is the probability matrix. Each entry in the probability matrix is a linear combination of expectation values 
of projectors. Note that the expectation value of a projector P = is the probability to find the systems in the 

state \tp). 

^^^^= [7.8] Example: the quantum state of spin | 

We will look at a two-site system, and write the matrix: ^ in the following way: 

We may write the basis of this space in a more convenient form. For this reason we will define the Pauli matrices: 
We note that these matrices are all Hermitian. 

Any operator can be written as a linear combination of the Pauli matrices: 

A = ci + acjx + Pay + (193) 

If the operator A is Hermitian then the coefficients of the combination are real. We see that in order to determine 
the quantum state of spin i we must make three independent measurements, say of (Tx,y.z- Then we can predict the 
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result of any other measurement by: 



(A) =c + a(a,)+/3(ay) +7(a,) 



(194) 



One way of "packaging" the 3 independent measurements is the polarization vector: 



A/=((a.),(a,),(a,)) 

But the standard "package" is the probability matrix whose elements are the expectation values of: 



(195) 



pTT = 
pll — 
pit = 
= 



T)(T 
iXi 
TXi 
i)(T 



1 





1 

1 





1 



= -(l + a,) = 
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i(cr.^ - iay) 



1 - P^ 



i(2P^ 
2 

i(2P^ 



1) + -(2P^-1) 
1)-^(2P^-1) 



We get the following relation between the two types of " packages" : 



(197) 



[7.9] Pure states as opposed to mixed states 



After diagonalization, the probability matrix can be written as: 



(Pi 

P2 
P3 



V 



(198) 



The convention is to order the diagonal elements in descending order. Using the common jargon we say that the state 
represented by p is a mixture of |1), |2), |3), . . . with weights pi,p2,P3, • ■ ■■ The most well known mixed state is the 
canonical state: 



(199) 



Where j3 =- ksT. A "pure state" is the special case where the probability matrix after diagonalization is of the form: 



/I 





(200) 
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This may be written in a more compact way as p = |1)(1| = {ip) ('01 = P^- Note that {P'^) = 1. This means a definite 
outcome for a measurement that is aimed in checking whether the particle is in state " 1" . That is why we say that 
the state is pure. 

== [7.10] Various versions of the expectation value formula 



[1] The standard version of the expectation value formula: 

(A) = tr{Ap) (201) 
[2] The "mixture" formula: 

(A) =Y,p.,.{r\A\r) (202) 

r 

[3] The "sandwich" formula: 

{A}^ = {^\A\i,) (203) 
[4] The "projection" formula: 

ProM^IV-) = !(</>! (204) 

The equivalence of statements 1-4 can be proved. In particular let us see how we go from the fourth statement to the 
third: 

(A).^, = J2 P^-oh{a\i')a = J2 I («l^> l'« = (^l^l'A) (205) 
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[8] The evolution of quantum mechanical states 
== [8.1] The Evolution Operator and the Hamiltonian 



We will discuss a particle in site If we multiply the basis vector by a constant, for example —8, we will get a new 
basis: |1) ~ ~8|1) which isn't normalized and therefore not convenient to work with. Explanation: if we represent 
the state lip) as a linear combination of normalized basis vectors = J^j i-'jlj)- then we can find the coefficients of 
the combination by using the following formula: ipi = {i\ip)- 

Even if we decide to work with "normalized" states, there is still a some freedom left which is called "gauge freedom" 
or "phase freedom". We will consider the state | t) and the state et'| 1). For these states p is the same: Multiplying 
a vector-state with a phase factor does not change any physical expectation value. 

From the superposition principle and what was said above regarding the normalization, it follows that the evolution 
in quantum mechanics will be described by a unitary operator. 

1^-*=°) ^ 1^*) (206) 
1^*) = f/|V*=°) 

In order to simplify the discussion we will assume that the environmental conditions arc constant (constant fields in 
time). In such a case, the evolution operator must fulfill: 

Uit2+h)^U{t2)U{h) (207) 
It follows that the evolution operator can be written as 

U(t) = e"'*^ (208) 
Where H is called the Hamiltonian or "generator" of the evolution. 

Proof: The "constructive" way of proving the last formula is as follows: In order to know the evolution of a system 
from ti to t2 we divide the time interval into many small intervals of equal size dt ~ (t2 — ti)/N. This means that: 

U{t2, h) = C/(t2, t2-dt) - ■ ■ U{ti + 2dt, ti + dt)U{ti + dt, ti) (209) 

The evolution during an infinitesimal time interval can be written as: 

U{dt) = i - idm = e-"^*'^ (210) 

In other words, the Hamiltonian is the evolution per unit of time. Or we may say that Ti. is the derivative of U with 
respect to time. By multiplying many infinitesimal time steps we get: 

t/ = (1 - idtn) ■■■{!- idm){l - idm) = e-"^*^ • • • e-^dm^-^dtn ^ ^~itn (211) 

Where we have assumed that the Hamiltonian does not change in time, so that the multiplication of exponents can 
be changed into a single exponent with a sum of powers. We remember that that this is actually the definition of the 
exponential function in mathematics: exp(t) = (1 + t/N)^ . 



[8.2] The Schrodinger Equation 



Consider the evolution of a pure state: 

■4)'+'^^ = {I- idtn)i;* (212) 
-Ml 
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This is the Schrodingcr equation. For a general mixture 

P = El^)^''-(H (213) 

r 

we have 

|r) ^ U\r), {r\ (r|f7^ (214) 
Therfore the evolution of p in time is: 

Pt = Upt^^U^ (215) 

This is Liouville Von-Neumann equation. One of its advantages is that the correspondence between the formalism of 
statistical mechanics and quantum mechanics becomes explicit. The difference is that in quantum mechanics we deal 
with a probability matrix whereas in mechanical statistics we deal with a probability function. 

== [8.3] Stationary States (the "Energy Basis") 

We can find the cigenstatcs |n) and the eigenvalues En of a Hamiltonian by diagonalizing it. 

n\n)^En\n) (216) 
U\n) =e-'^"*|n) 

Using Dirac notation: 

{n\U\m) = (S„™e-'^"* (217) 
If we prepare a state that is a superposition of basis states: 

n 

we get after time t 

|^(t))=Ee-^''"Vn|n) (219) 



== [8.4] Rate of change of the expectation value 

For any operator A we define an operator B: 
r)A 

B^i[H,A] + ^ (220) 
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such that 

d{A) 



(B) (221) 



dt 

proof: From the expectation value formula: 

=trace(ip(t)) (222) 

We get 

= trace(|^p(i))+trace(i^) (223) 
dA 

= trace(— p(i)) -ztrace(A[7^,p(t)]) 
ot 

OA 

= Uace{—p{t)) + ttrace{[n,A]p{t)) 
ot 

8 A 

= { — )+^{[n,A]) 

Where we have used Liouville's equation and the cyclic property of the trace. Alternatively, if the state is pure we 
can write: 

(A), = imiMm) (224) 

and then we get 

= (|^|i|V.) + (^|A||v^) + (V'|^|^) (225) 

OA 

= i{^P\nA\i;) - i{i;\An\i;) + 
Where we have used the Schrodinger equation. 

We would like to highlight the distinction between a full derivative and a partial derivative. Let's assume that there 
is an operator that perhaps represents a field that depends on the time t: 

A = x^ + tx^ (226) 

Then the partial derivative with respect to t is: 

dA ^ , , 

- = x^ (227) 

While the total derivative of {A) takes into account the change in the quantum state too. 



^^^^= [8.5] Definition of generalized forces 

We would like to know how the system's energy changes when we change one of the parameters (X) that the 
Hamiltonian [Ti.) depends on. We define the generalized force T as 
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We remember that the rate of change formula for an operator A is: 
In particular, the rate of change of the energy is: 

If E{0) is the energy at time < = we can calculate the energy E{t) at a later time, and the work W: 

W = -{E{t)~E{0)) = J {T)Xdt = j {T)dX (231) 

A " Newtonian force" is associated with the displacement of a piston. A generalized force called " pressure" is associated 
with the change of the volume of a box. A generalized force called "polarization" is associated with the change in an 
electric field. A generalized force called "magnetization" is associated with the change in a magnetic field. 



[8.6] Definition of currents 



There are two ways to define "current" operators. The "probability current" is defined via the rate of change of the 
occupation operator (see discussion of the "continuity equation" in a later section). The "electrical current" is defined 
as the generalized force associated with the change in a magnetic flux, as explained below. 

Lets assume that at a moment t the fiux is $, and that at the moment t + dt the flux is <& + d^. The electromotive 
force (measured in volts) is according to Faraday's law: 

EMF = — - (232) 
dt ^ ' 

If the electrical current is / then the amount of charge that has been displaced is: 

dQ = Idt (233) 
Therefore the work which is done is: 



W = EMF xdQ = Id<P (234) 

This formula implies that the generalized force which is associated with the change of magnetic flux is in fact the 
electrical current. Note the analogy between flux and magnetic field, and hence between current and magnetization. 
In fact one can regard the current in the ring as the "magnetization" of a spinning charge. 



^^^^= [8.7] How do we know what the Hamiltonian is? 

We construct the Hamiltonian from "symmetry" considerations. In the next lecture our object will be to show that 
the Hamiltonian of a non-rclativistic particle is of the form: 

n^:^{p-Aix)f + Vix) (235) 

In this lecture we will discuss a simpler case: the Hamiltonian of a particle in a two-site system. We will make the 
following assumptions about the two-site dynamics: 
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• The system is symmetric with respect to reflection. 

• The particle can move from site to site. 



LTU 

These two assumptions determine the form of the Hamiltonian. In addition, we will see how "gauge" considerations 
can make the Hamiltonian simpler, without loss of generality. 

First note that because of gauge considerations, the Hamiltonian can only be determined up to a constant. 

n^n + eQl (236) 
Namely, if we add a constant to a Hamiltonian, then the evolution operator only changes by a global phase factor: 

U{t) e-**C"+"°^) = e-"=«*e-'*" (237) 

This global phase factor can be gauged away by means of time dependent gauge transformation. We shall discuss 
gauge transformations in the next sections. 

=== [8.8] The Hamiltonian of a two-site system 

It would seem that the most general Hamiltonian for a particle in a two-site system includes 4 parameters: 

^ - (c2^ (238) 

Because of the assumed reflection symmetry ei = £2 = e it seems that we are left with 3 parameters. But in fact there 
is only one physical parameter in this model. Thanks to gauge freedom we can define a new basis: 

|i) = |1) (239) 
1 2) = e"^|2) 

and we see that: 

(2|7^|i) = e-*"*(2|7^|l) = e-^'^ce"'' = c (240) 
Therefore we can set (j) — without loss of generality. Then the Hamiltonian can be written as: 

^=(oO + (co)=^l + -^ (241) 

We also can make a gauge transformation in time. This means that the basis is time t is identified as |1) = exp{—iet)\l) 
and |2) = exp(— zet)|2). Using this time dependent basis we can get rid of the constant e. In fact, on physical grounds, 
one cannot say whether the old or new basis is "really" time dependent. All we can say is that the new basis is time 
dependent relative to the old basis. This is just another example of the relativity principle. The bottom line is that 
without loss of generality we can set e = 0. 
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[8.9] The evolution of a two-site system 



The eigenstates of the Hamiltonian are the states which are symmetric or anti-symmetric with respect to reflection: 

1+) = + |2)) (242) 
H = ;^(|1)H2)) 

The Hamiltonian in the new basis is: 

n^{l = C.3 (243) 

Let us assume that we have prepared the particle in site number one: 

IV'*=°>-|l> = ^(l+> + |-» (244) 

The state of the particle, after time t will be: 

|V*> = ^(e-*^*|+)+e-*(-^)*|-)) =cos(ct)|l) -isin(ct)|2) (245) 
v2 

We see that a particle in a two-site system makes coherent oscillations between the two sites. That is in contrast with 
classical stochastic evolution where the probability to be in each site (if we wait long enough) would become equal. In 
the future we will see that the ability to pass from site to site is characterized by a parameter called " inertial mass" . 
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[9] The non-relativistic Hamiltonian 



[9.1] N Site system in the continuum Limit 



In the last lesson we found the Hamiltonian 7Y in a two-site system by using gauge and symmetry considerations. 
Now we will generalize the result for an iV-site system. We will give each site a number. The distance between two 
adjacent sites is a. The basic assumption is that the particle can move from site to site. The generator of the particle's 
movement is Ti.. 



I2> 



r n 



r 



UiJ{dt)=6^J-idm^j (246) 

The Hamiltonian should reflect the possibility that the particle will either stay in its place or move one step right or 
left. Say that = 4. Taking into account that it should be Hermitian it has to be of the form 



'H^j 



/V 


c* 





c 


c 


V 


c* 








c 


V 


c* 


\c* 





c 


V 



(247) 



For a moment we assume that all the diagonal elements ("on sites energies") are the same, and that also all the 
hopping amplitudes are the same. Thus for general TV we can write 

n = cD + c*D-^ + Const = cq-'^p + c*e"P + Const (248) 
We define c = coe*"^, where cq is real, and get: 

n = coe-'('^j^-*) + coC^^^P-^) + Const (249) 
We define A = (j)/a (phase per unit distance) and get: 

n = coe-'''(P^^) + coe"(P-^) + Const (250) 
By using the identity e" sa 1 + za; — (l/2)a:;^ we get: 

n^-L(^p-A)' + V (251) 

Where we have defined l/(2m) = —cqo^ and V = 2co + Const. Now fi has three constants: m,A, V. If we assume 
that the space is homogenous then the constants are the same all over space. But, in general, it does not have to be 
so, therefore: 

T< = ir^ip-M^))^+V{x) (252) 
Zm[x) 

In this situation we say that there is a field in space. Such a general Hamiltonian could perhaps describe an electron 
in a metal. At this stage we will only discuss a particle whose mass m is the same all over space. This follows if 
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we require the Hamiltonian to be invariant under Galilei transformations. The Galilei group includes translations, 
rotations and boosts (boost = one system moves at a constant velocity relative to another system). The relativistic 
version of the Galilei group is the Lorentz group (not included in the syllabus of this course). In addition, we expect 
the Hamiltonian to be invariant under gauge transformations. This completes our basic requirement for invariance. 

^^^^= [9.2] The Hamiltonian of a Particle in 3-D Space 



In analogy to what wc did in one dimension, we write: 

U ^ cD.:, + cD^^ + cDy + cDy^ + cD^ + c D^^ = (253) 
= ce-'"'^- + c*e"'P^ + ce-"^^« + 0*6"^^" + ce""*^- + c'e^'^P' 

After expanding to second order and allowing space dependence we get: 

n = 2^(p -kf + V= 2^(p - A(f))2 + T/(x) (254) 

= 7r-(Px - A^{x,y,z)f + -^{py - Ay{x,y,z)f + -^{pz ~ Az{x,y,z)f + V{x,y,z) 
zm zm zm 



== [9.3] Geometric phase and dynamical phase 

Consider the case where there is no hopping between sites (cq = 0), hence the Hamiltonian Ti. docs not include a 
kinetic part: 

n = V{x) (255) 
U{t) = c-'*^(*) 

The particle does not move in space. V is the "dynamical phase" that the particle accumulates per unit time. V in a 
specific site is called "binding energy" or "on site energy" or "potential energy" depending on the physical context. 
A V that changes from site to site refiects the nonhomogeneity of the space or the presence of an "external field". If 
the system were homogeneous, we would expect to find no difference between the sites. 

Once we assume that the particle can move from site to site we have a hopping amplitude which wc write as c = coe*"^. 
It includes both the geometric phase (j) a-^d the "inertial" parameter cq, which tells us how "difficult" it is for the 
particle to move from site to site. More precisely, in the Hamiltonian matrix we have on the main diagonal the 
"spatial potential" Vi, whereas on the other diagonals we have the hopping amplitudes Ci-fje^"^'^^ . If the space is 
not homogeneous, the hopping coefficients do not have to be identical. For example |c2^3| can be different from 
|ci_,2|. Irrespective of that, as the particle moves from site i to site j it accumalates a geometric phase (pi-^j. By 
definition the vector potential A is the "geometric phase" that the particle accumulates per unit distance. Hence 
(jji^j = A - {vj -Vi). 



^^^^= [9.4] Invariance of the Hamiltonian 

The definition of "invariance" is as follows: Given that Ti = h{x,p; V, A) is the Hamiltonian of a system in the labora- 
tory reference frame, there exist V and A such that the Hamiltonian in the "new" reference frame isH — h{x,p; V, A). 
The most general Hamiltonian that is invariant under translations, rotations and boosts is: 



n = h{x,p; V,A)^—{p- A{x)f + V{x) 
zm 



(256) 
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Let us demonstrate the invariance of the Hamiltonian under translations: in the original basis |a;) wc have the fields 
V{x) and A{x). In the translated reference frame the Hamiltonian looks the same, but with V{x) = V{x + a) and 
A{x) = A{x + a). We say that the Hamiltonian is "invariant" (keeps its form). In order to make sure that we have not 
"mixed up" the signs, we will assume for a moment that the potential is V{x) = S{x). If we make a translation with 
a = 7, then the basis in the new reference frame will be \x) = |x + 7), and wc would get V{x) = V{x + a) = S{x + 7) 
which means a delta at x = — 7. 

== [9.5] Invariance under Gauge Transformation 



Let us define a new basis: 

= e-'^M^^i) (257) 
\i2) =e-'^'\x2) 

and in general: 

|x) = e-*^(")|x) (258) 
The hopping amplitudes in the new basis are: 

£i^2 = {i2\n\ii) = e'^^^-^'\x2\H\xi) = e'^^^-^^'ci^a (259) 
We can rewrite this as: 

4>i^2 = (pi-.2 + (A2 - Ai) (260) 
Dividing by the size of the step and taking the continuum limit we get: 

A{x) = A{x) + -^A{x) (261) 
dx 

Or, in three dimensions: 

A{x) = A{x) +\/A{x) (262) 

So we see that the Hamiltonian is invariant (keeps its form) under gauge. As we have said, there is also invariance 
for all the Galilei transformations (notably boosts). This means that it is possible to find transformation laws that 
connect the fields in the "new" reference frame with the fields in the "laboratory" reference frame. 

== [9.6] Is it possible to simplify the Hamiltonian further? 

Is it possible to find a gauge transformation of the basis so that A will disappear? We have seen that for a two-site 
system the answer is yes: by choosing A(a;) correctly, wc can eliminate A and simplify the Hamiltonian. On the other 
hand, if there is more than one route that connects two points, the answer becomes no (in other words, for systems 
with three sites or more). The reason is that in every gauge wc may choose, the following expression will always be 
gauge invariant: 

f A ■ dl = f A ■ dl = gauge invariant (263) 



In other words: it is possible to change each of the phases separately, but the sum of phases along a closed loop will 
always stay the same. Wc shall demonstrate this with a three-site system: 
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|i> = 








|2) = 








|3) = 










= 01^2 4 


-(A2- 


Ai) 


(t>2->3 


= 02^3 4 


- (A3 - 


A2) 


03^1 


= 03^1 4 


-(Ai - 


A3) 


01-»2 


+ 02-»3 4 


<^3-»l 


-01 



(264) 



^2^3 



^3^1 



If the system had three sites but with an open topology, then we could have gotten rid of A like in the two-site system. 
That is also generally true of all the one dimensional problems, if the boundary conditions are "zero" at infinity. Once 
the one-dimensional topology is closed ("ring" boundary conditions) such a gauge transformation cannot be made. 
On the other hand, when the motion is in two or three dimensional space, there is always more than one route that 
connects any two points, without regard to the boundary conditions, so in general one cannot eliminate A. 



[9.7] The classical equations of motion 



If X is the location of a particle, then its rate of change is called velocity. By the rate of change formula we identify v 
as 



v^i[H,x]^t[^ip-Aix)f,x] = -ip-A{x)) 
Zm m 



and wc have: 

The rate of change of the velocity v is called acceleration: 

'^^^^ = (a) 

"1 



dt ' ' 
d = i\H, v] 



(265) 



(266) 



(267) 



dv 1 



dt m 
Where we have defined: 



-{vxB~Bxv) + £ 



6 = V X ^ 
OA 
^~dt " 



£ 



(268) 



We would like to emphasize that the Hamiltonian is the "generator" of the evolution of the system, and therefore all 
the equations of motion can be derived from it. From the above it follows that in case of a "minimal" wavepackct the 
expectation values of x and v and d obey the classical equations approximately. 
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In the expression for the acceleration we have two terms: the "electric" force and the "magnetic" (Lorentz) force. 
These forces bend the trajectory of the particle. It is important to realize that the "bending" of trajectories has to 
do with interference and has a very intuitive heuristic explanation. This heuristic explanation is due to Huygens: We 
should regard each front of the propagating beam as a point-like source of waves. The 'next front (after time dt) is 
determined by interference of waves that come from all the points of the previous front. For presentation purpose 
it is easier to consider first the interference of = 2 points, then to generalize to N points, and then to take the 
continuum limit of plain front. The case TV = 2 is formally equivalent to a two slit experiment. The main peak of 
constructive interference is in the forward direction. We want to explain why a non uniform V{x) or the presence of 
magnetic field can shift the main peak. A straightforward generalization of the argument explains why a trajectory 
of a plane wave is bent. 

Consider the interference of partial waves that originate from two points on the front of a plane wave. In the absence 
of external field there is a constructive interference in the forward direction. However if V{x) in the vicinity of one 
point is smaller, it is like having larger "index of refraction". As a result the phase of tpi^) grow more rapidly, and 
consequently the constructive interference peak is shifted. We can summarize by saying that the trajectory is bending 
due to the gradient in V{x). A similar effect happens if the interfering partial waves enclose an area with a magnetic 
field. We further discuss this interference under the headline "The Aharonov Bohm effect" : It is important to realize 
that the deflection is due to an interference effect. Unlike the classical point of view it is not B{x) that matters but 
rather A{x), which describes the geometric accumulation of the phase along the interfering rays. 

== [9.8] The continuity Equation (conservation of probability) 



The Schrodinger equation is traditionally written as follows: 

n = n{x,p) (269) 
d\^) 



dt 



5* n,f ^ , X 

— = -in x,-i— * 



dt \ ' dx 



±{-^V-A{x)r+Vix) 



^(x) 



dt 

Using the "rate of change formula" for the probability density we can obtain a continuity equation: 

^ = -V.J(x) (270) 

Where the probability density and probability current are: 

p{x) ^ \^{x)\' (271) 

J{x) = i?e[«'*(.T)-(-iV - A(.t))*(.t)] (272) 
m 

The procedure to get this result is best illustrated once applied to an A^-site system (see appropriate " QM in practice" 
section). One should realize that both the probability density and the probability current are expectation values: 

p{x) = {^flpi^m (273) 

j{x) = {^iH'^m 

where we have defined the operators: 

p{x) = S{x-x) (274) 
J{x) = —{v6{x — x)+6{x — x)v)) 



[10] Symmetries and their implications 



48 



== [10.1] The Concept of Symmetry 

Pedagogical remark: In order to motivate and to clarify the abstract discussion in this section it is recommended to 
consider the problem of finding the Landau levels in Hall geometry, where the system is invariant to x translations and 
hence is a constant of motion. Later the ideas are extended to discuss motion in centrally symmetrical potentials. 

We emphasize that symmetry and invariance are two different concepts. Invariance means that the laws of physics 
and hence the form of the Hamiltonian do not change. But the fields in the Hamiltonian may change. In contrast 
to that in case of a symmetry we requite H = H, meaning that the fields look the literally same. As an example 
consider a particle that moves in the periodic potential V{x;R) = cos(27r(a; — R)/L). The Hamiltonian is invariant 
under translations: If we make translation a then the new Hamiltonian will be the same but with R = R — a. But in 
the special case that R/ L is an integer we have symmetry, because then V{x\ R) stays the same. 

== [10.2] What is the meaning of commutativity? 

Let us assume for example that [}-L,Px] ~ 0. We say in such case that the Hamiltonian commutes with the generator 
of translations. What are the implication of this statement? The answer is that in such case: 

• The Hamiltonian is symmetric under translations 

• The Hamiltonian is block diagonal in the momentum basis 

• The momentum is a constant of motion 

• There might be systematic degeneracies in the spectrum 

The second statement follows from the "separation of variables" theorem. The third statement follows from the 
expectation value rate of change formula: 

^ = m,px]) = (275) 

For time independent Hamiltonians E = (H) is a constant of the motion because [H,Ti.] = 0. Thus {H) = const 
is associated with symmetry with respect to "translations" in time, while (p) = const is associated with symmetry 
with respect to translations in space, and (L) = const is associated with symmetry with respect to rotations. In the 
foUwing two subsection we further dwell on the first and last statements in the above list. 

== [10.3] Symmetry under translations and rotations 

If [H,Px] = then for every translation a: 

[n,D{a)] ^ HD-DH = (276) 
D-^HD = n 

If we change to a translated frame of reference, then we have a new basis which is defined as follows: 

\x) = \x + a) = D\x) (277) 
This means that the transformation matrix is T = D{a), and that the following symmetry is fulfilled: 

= T-^UT = n (278) 
We say that the Hamiltonian is symmetric under translations. This can be summarized as follows: 



\H, D{d)] — 0, for any a 



(279) 
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is equivalent to 



[H,pi] = for i = x,y,z 



(280) 



An analogous statement holds for rotations: Instead of writing: 



[n, R{S)] = 0, for any $ 



(281) 



We can write: 



[H, Li] ~ for i = x,y,z 



(282) 



If this holds it means that the Hamiltonian is symmetric under rotations. 



[10.4] Symmetry implied degeneracies 



Let us assume that 7i is symmetric under translations D. Then if \^p) is an eigenstate of 7i then also \ip) = D\^p) is 
an eigenstate with the same eigenvalue. This is because 



Now there are two possibilities. One possibility is that j?/;) is an eigenstate of D, and hence \(p) is the same state as 
\ip)- In such case we say that the symmetry of \ip) is the same as of 7i, and a degeneracy is not implied. The other 
possibility is that \ip) has lower symmetry compared with Ti.. Then it is implied that \^) and \ip) span a subspace of 
degenerate states. If we have two symmetry operations A and B, then we might suspect that some eigenstates would 
have both symmetries: that means both A\ip) cx and B\tp) oc j?/;). If both symmetries hold for all the eigenstates, 
then it follows that [A, B] = 0, because both are diagonal in the same basis. 

In order to argue symmetry implies degeneracies the Hamiltonian should commute with a non-commutative group of 
operators. It is simplest to explain this statement by considering an example. Let us consider particle on a clean ring. 
The Hamiltonian has symmetry under translations (generated by p) and also under reflections (i?) . We can take the 
kn states as a basis. They are eigenstates of the Hamiltonian, and they are also eigenstates of p. The ground state 
n = has the same symmetries as that of the Hamiltonian and therefore there is no implied degeneracy. But |fc„) 
with n ^ has lower symmetry compared with 7i, and therefore there is an implied degeneracy with its mirror image 
|fc_„). These degeneracies are unavoidable. If all the states were non-degenerated it would imply that both p and R 
are diagonal in the same basis. This cannot be the case because the group of translations together with reflection is 
non-commutative. 

The dimension of a degenerate subspace must be equal to the dimension of a representation of the symmetry group. 
This is implied by the following argument: One can regard 7i as a mutual constant of motion for all the group operators; 
therefore, by the "separation of variables theorem" , it induces a block-decomposition of the group representation. The 
dimensions of the blocks are the dimensions of the degenerate subspaces, and at the same time they must be compatible 
with the dimensions of the irreducible representations of the group. 

Above we were discussing only the systematic degeneracies which are implied by the symmetry group of the Hamil- 
tonian. In principle we can have also "accidental" degeneracies which are not implied by symmetries. The way to 
"cook" such symmetry is as follows: pick two neighboring levels, and change some parameters in the Hamiltonian so 
as to make them degenerate. It can be easily argued that in general we have to adjust 3 parameters in order to cook 
a degeneracy. If the system has time reversal symmetry, then the Hamiltonian can be represented by a real matrix. 
In such case it is enough to adjust 2 parameters in order to cook a degeneracy. 



n\ip) = nD\ij) = Dn\tij) = E\if) 



(283) 



Fundamentals (part III) 
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[11] Group representation theory 



= [11.1] Groups 

A group is a set of elements with a binary operation: 

• The operation is defined by a muhiphcation table for * r^. 

• There is a unique identity element 1. 

• Every element has an inverse element so that tt^^ = 1 

• Associativity: * (r^ * t^) = {t^ * t^) * 

Commutativity does not have to be fulfilled: this means that in general * t"^ ^ t"^ * . 

The Galilei group is our main interst. It includes translations, rotations, and boosts. A translation is specified 
uniquely by three parameters (ai, 02, 03), or for short a. Rotations arc specified by (6, ip, $), or for short A boost 
is parametrized by the relative velocity 112,113). A general element is any translation, rotation, boost, or any 
combination of them. Such a group, that has a general element that can be defined using a set of parameters is called 
a Lie group. The Galilei group is a Lie group with 9 parameters. The rotation group (without reflections!) is a Lie 
group with 3 parameters. 



[11.2] Realization of a Group 



If there are H elements in a group, then the number of rows in the fuU muhiphcation table wiU be (H^)^ = H^^. The 
multiplication table is too big for us to construct and use. Instead we will have to make a realization: wc must realize 
the elements of the group using transformations over some space. The realization that defines the Galilei group is 
over the six dimensional phase space (x, v). The realization of a translation is 

^ ^ ^ (284) 
V = v ^ ' 

The realization of a boost is 

:J + u (285) 

and the realization of a rotation is 
'i = i?^($)x 



v = i?^'($)v (2^^) 

A translation by b, and afterward a translation by a, gives a translation by Tb+a. This is simple. More generally the 
"multiplication" of group elements r'^ = * is realized using a very complicated function: 

(a3,*3^u3) ^ /(a2,*^u2,a\*\ui) (287) 

We notice that this function receives input that includes 18 parameters and gives output that includes 9 parameters. 



== [11-3] Realization using linear transformations 

As mentioned above, a realization means that wc regard each element of the group as an operation over a space. We 
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treat the elements as transformations. Below we will discuss the possibility of finding a realization which consists of 
linear transformations. 

First we will discuss the concept of linear transformation, in order to clarify it. As an example, we will check whether 
f{x) = x + 5 is a linear function. A linear function must fulfill the condition: 

/(aXi + (3X2) = af{Xi) + l3f{X2) (288) 

Checking f{x): 

/(3) = 8,/(5) = 10,/(8) = 13 (289) 
/(3 + 5)^/(3) + /(5) 

Hence we realize that f{x) is not linear. 

In case of the defining realization of the Galilei group over the phase space, rotations are linear transformations, but 
translations and boosts are not. If we want to realize the Galilei group using linear transformations, the most natural 
way would be to define a realization over the function space. For example, the translation of a function is defined as: 

Ta : *(x) = *(x-a) (290) 

The translation of a function is a linear operation. In other words, if we translate a'^i{x) + (3^1 2{x)^ we get the 
appropriate linear combination of the translated functions: a^ii{x — a) + /3\['2(x — a). 

Linear transformations are represented by matrices. That leads us to the concept of a "representation". 

^^^= [11.4] Representation of a group using matrices 

A representation is a realization of the elements of a group using matrices. For every element r of the group, we 
find an appropriate matrix U{t). We demand that the "multiplication table" for the matrices will be one-to-one to 
the multiplication table of the elements of the group. Below we will "soften" this demand and be satisfied with the 
requirement that the "multiplication table" will be the same "up to a phase factor". In other words, if = t'^ *t^, 
then the appropriate matrices must fulfill: 

C/(t3) = c*(phase)f^(^2)y(^i) (291) 

It is natural to realize the group elements using orthogonal transformations (over a real space) or unitary transforma- 
tions (over a complex space) . Any realization using linear transformation is automatically a " representation" . The 
reason for this is that linear transformations are always represented by matrices. For example, we may consider the 
realization of translations over the function space. Any function can be written as a combination of delta functions: 



^{x) = J ^{x')S{x-x')dx (292) 

In Dirac notation this can be written as: 

\^)=J2'i'M (293) 

X 

In this basis, each translation is represented by a matrix: 



= {x\D{a)\x') = 5ix~{x' + a)) 



(294) 
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Finding a "representation" for a group is very convenient, since the operative meaning of "multiplying group elements" 
becomes "multiplying matrices". This means that we can deal with groups using linear algebra tools. 

== [11.5] Commutativity of translations and boosts? 



If we make a translation and afterward a boost, then we get the same transformation as we would get if we made the 
boost before the translation. Therefore, boosts and translations commute. It is not possible to find a representation 
over function space that shows this commutativity. Therefore, we will have to "soften" the definition of "representa- 
tion" and demand that the multiplication table will be correct "up to a phase factor". Therefore, from now on we 
will assume that translations and boosts do not commute! 

Proving the last conjecture is very intuitive if we use the physics language. Let's assume that boosts do commute 
with translations. Say we have a particle in the laboratory reference frame that is described by a wave function that 
is an eigenstate of the translation operators. In other words, we are talking about a state with a defined momentum 
k. We will move to a moving reference frame. It is easy to prove that if '^{x) is an eigenstate with a specific fc, then 
^'(a;) is an eigenstate in the moving reference frame, with the same k. This follows from our assumption that boosts 
and translations commute. From this we come to the absurd conclusion that the particle has the same momentum in 
all the reference frames... If we don't want to use a trivial representation over the function space, we have to assume 
that boosts and translations do not commute. 



[11.6] Generators 



Every element in a Lie group is marked by a set of parameters: 3 parameters for the group of rotations, and 
9 parameters for the Galilei group which includes also translations and boosts. Below we assume that we have a 
"unitary representation" of the group. That means that there is a mapping 

T^U{Tl,T2,...,Tf,,...) (295) 

We will also use the convention: 

1 1-^ f7(0,0, ...,0, ...) = i = identity matrix (296) 
We define a set of generators in the following way: 

f/(0,0,...,,5T^,0,...) = i-i5r^,G^ = e''^^-^- (297) 
(there is no summation here) For example: 

[/(<5ri,0,0,...) = i-idTiGi = e-'^^''^' (298) 

The number of basic generators is the same as the number of parameters that marks the elements of the group (3 
generators for the rotation group). In the case of the Galieli we have 9 generators, but since we allow arbitarary phase 
factor in the multiplaication table, we have in fact 10 generators: 



Px 7 Py ^ Pz ^ Jx : Jy ^ Jz : Q X : Qy ^ Q z : and 1. (299) 

The generators of the boosts, when using a representation over the function space, are Qx ~ — ma;, etc., where m is 
the mass. It is physically intuitive, since we may conclude from the commutation relation [x,p\ = i that —x is the 
generator of translations in p. 
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[11.7] How to use generators 



In general a transformation which is generated by A would not commute with a transformation which is generated 
hyB, 



But if the generated transformations are infinitesimal then: 

e^^e'-^ = e^-^e<^^ = i + ei + + ©(e^) (301) 

We can use this in order to show that any transformation U (r) can be generated using the complete set of generators 
that has been defined in the previous section. This means that it is enough to know what are the generators in order 
to calculate all the matrices of a given representation. The calculation goes as follows: 

U(t) = ([/(<5r))^ (302) 

= (C/(<5Tl)t/(<5T2)...)^ = 

= (c~''''^i'^ie~''''^^^^ )^ = 

^~i&T\G\ — i&TiG^...\N 

= (e-»«-G)JV = g-.r.G 

The next issue is how to multiply transformations. For this we have to learn about the algebra of the generators. 



[11.8] Combining generators 



It should be clear that if A and B generate (say) rotations, it does not imply that (say) the hermitian operator 
AB + BA is a generator of a rotation. On the other hand we have the following important statement: if A and B are 
generators of group elements, then also G = a A + (3B and G = i[A, B] are generators of group elements. 

Proof: by definition G is a generator if e^"^*^ is a matrix that represents an element in the group. We will prove the 
statement by showing that the infinitesimal transformation e~^'^^ can be written as a multiplication of matrices that 
represent elements in the group. In the first case: 

In the second case we use the identity: 

^elA.B] ^ ^-i^-eB ^-^^eA ^^^eB ^^^eA _^ ^^^2^ 

This identity can be proved as follows: 

1 + e{AB - BA) = (1 - iV^B - ^eS^)(l - ^V^A - ^e^^)(l + iV^B - ^eS^)(l + - ^eA^) (305) 
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== [11'9] Structure constants 

Any element in the group can be written using the set of basic generators: 

U{t) = e-'^-^ (306) 

From the previous section it foUows that i[G^,G^] is a generator. Therefore, it must be a hnear combination of the 
basic generators. In other words, there exit constants c^^ such that the foUowing cfosure relation is fulfilled: 

[G^,G,]^tY,4.Gx (307) 

A 

The constants c^^ are called the "structure constants" of the group. Every "Lie group" has its own structure 
coefficients. If we know the structure coefficients then we can reconstruct the group's "multiplication table". Below 
we will find the structure coefficients of the rotation group, and in the following lectures we will learn how to build 
all the other representations of the rotation group, from our knowledge of the structure coefficients. 

^^^^= [11.10] The structure constants and the mult ipUcat ion table 

In order to find the group's multiplication table from our knowledge of the generators, we must use the formula: 

e^e^ = e^+^+^ (308) 

Where C is an expression that includes only commutators. There is no simple expression for C. However, it is possible 
to find ("per request") an explicit expression up to any accuracy wanted. By Taylor expansion up to the third order 
we get: 

C = log(e^e^) - A - B = ^[A, B] - ^[[A, B],iA - B)] + ... (309) 

From this we conclude that: 

e-^^^-c^-^f^-C! = e-^i-G (310) 

Where: 

7A = aA + /3a + \cl^a^l3, - ^c^^^c;^^{a - /3).a^/5. + ... (311) 



For more details see paper by Wilcox (1967) available in the course site. 
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[12] The group of rotations 



= [12.1] The rotation group SO(3) 

The rotation group 50(3) is a non-commutative group. That means that the order of rotations is important. Despite 
this, it is important to remember that infinitesimal rotations commute. We have already proved this statement in 
general, but we will prove it once again for the specific case of rotations: 



i?((5*)r 



(5* X r 



So: 



(312) 



i?(5*^)i?(5*^)r = (r + (5*1 X r) + 5*^ x (r + 6^^ x r) = (313) 
= r + ((5*1 + (5*2) X r = 
= E(5*i)i?((5*2)r 

Obviously, this is not correct when the rotations arc not infinitesimal: 

i?($i)i?(<i>2) ^ i?(<i)i + $2) ^ i?(*2)_R($i) (314) 

We can construct any infinitesimal rotation from small rotations around the major axes: 

i?(<5$) = i?((5$,e, + S<Pyey + S<P,e,) = R{S<i>^e^)R{5%ey)R{S^,e,) (315) 

If we mark the generators by M — {Mx, My, Mz), then we conclude that a finite rotation around any axis can be 
written as: 

i?($n) = R{i) = i?((5$)^ = {R{S<i>x)R{S^y)R{S^z))^ = ^^-^S$■M^N ^ ^^^$■M ^ ^^^^S.M^ (g^g) 

We have proved that the matrix M„ = n • M is the generator of the rotations around the axis n. 



== [12.2] Structure constants of the rotation group 

We would like to find the structure constants of the rotation group 50(3), using its defining representation. The 5*0(3) 
matrices induce rotations without performing reflections and all their elements are real. The matrix representation of 
a rotation around the z axis is: 



'^cos($) - sin(<I>) 0^ 
R{<^ez) = I sin($) cos($) 











1, 



(317) 



For a small rotation: 



i?(<5$e.) 





-5$ 





(5$ 


1 














l + (5$ 



^0 -1 0> 
1 
,0 0> 



1 - iS^Mz 



(318) 



Where: 



^0 -i 0> 
Mz = \i 
,0 Oj 



(319) 
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We can find the other generators in the same way: 
^0 

Mx = I I (320) 
,0 i 





0; 



My = I 



Or. written compactly: 

{Mk)^J = ~ie^Jk (321) 

We have found the 3 generators of rotations. Now we can calculate the structure constants. For example 
[Mx, My] = iMz, and generally: 

[M„ M,] = zeyfcMfe (322) 



12.3] Motivation for finding dim=2 representation 



We defined the rotation group by the Euclidean realization over 3D space. Obviously, this representation can be used 
to make calculations ("to multiply rotations"). The advantage is that it is intuitive, and there is no need for complex 
numbers. The disadvantage is that they are 3x3 matrices with inconvenient algebraic properties, so a calculation 
could take hours. It would be convenient if we could "multiply rotations" with simple 2x2 matrices. In other words, 
we are interested in a dim=2 representation of the rotation group. The mission is to find three simple 2x2 matrices 
that fulfill: 

[Jx,Jy]=iJz etc. (323) 

In the next lecture we will learn a systematic approach to building all the representations of the rotation group. In 
the present lecture, we will simply find the requested representation by guessing. It is easy to verify that the matrices 



Sx = ^(Jx, Sy = ^(Ty, Sz = ^(Tz (324) 



fulfill the above commutation relations. So, we can use them to create a dim=2 representation of the rotation group. 
We construct the rotation matrices using the formula: 

R = e"'*-^ (325) 
The matrices that we get will necessarily fulfill the right multiplication table. 

We should remember the distinction between a realization and a representation: in a realization it matters what we 
are rotating. In a representation it only matters to us that the correct multiplication table is fulfilled. Is it possible 
to regard any representation as a realization? Is it possible to say what the rotation matrices rotate? When there is a 
dim=3 Euclidean rotation matrix we can use it on real vectors that represent points in space. If the matrix operates 
on complex vectors, then we must look for another interpretation for the vectors. This will lead us to the definition 
of the concept of spin (spin 1). When we are talking about a dim=2 representation it is possible to give the vectors 
an interpretation. The interpretation will be another type of spin (spin 1/2). 
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== [12.4] How to calculate a general rotation matrix 

The general formula for constructing a 3 x 3 rotation matrix is: 

i?(<i>) = = e"**^^" = 1 - (1 -cos($))M2-isin($)M„ (326) 

where Mn = n ■ M is the generator of a rotation around the n axis. All rotations are "similar" one to the other 
(moving to another reference frame is done by means of a similarity transformation that represents change of basis). 
The proof is based on the Taylor expansion. We notice that = M^, from this it follows that for all the odd powers 
= M^, while for aU the even powers = Ml where fc > 0. 

The general formula for finding a 2 x 2 rotation matrix is derived in a similar manner. All the even powers of a given 
Pauli matrix are equal to the identity matrix, while all the odd powers are equal to the original matrix. From this 
(using Taylor expansion and separating into two partial sums), we get the result: 

i?($) = i?(3>n) = e"**^" = cos($/2)i - i sin($/2)(7„ (327) 

where an = n ■ a, and Sn = (l/2)o'„ is the generator of a rotation around the n axis. 



^^^= [12.5] An example for multiplication of rotations 

Let us make a 90° rotation R{9Q'^ez) around the Z axis, followed by a 90° rotation i?(90°ej,) around the Y axis. We 
would like to know what this sequence gives. Using the Euclidean representation 

i? = 1 - i sin $A/„ - (1 - cos $)Af,^ (328) 

we get 

J?(90°e^) = 1 - iM, - M| (329) 
i?(90°ey) = 1 - iMy - 

We do not wish to open the parentheses, and add up 9 terms which include multiplications of 3 x 3 matrices. Therefore, 
we will leave the Euclidean representation and try and do the same thing with a dim=2 representation, which means 
we will work with the 2x2 Pauli matrices. 

i?($) = cos($/2)i - i sin($/2)cr„ (330) 
i?(90°e,) = 

V2 

i?(90%) = -^(i-za,) 

Hence 

R ^ R{90°ey)R{90°ez) = ^(1 - ia.^ - ioy - ia^) (331) 

Where we have used the fact that o-ycr^ = iax- We can write this result as: 

120° 120°^ ^ , , 

it = cos ism ^ n - a (332) 

Where n = "^(1' I7 !)• This defines the equivalent rotation which is obtained by combining the two 90° rotations. 
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= [12.6] Euler Angles 

We can prove the following identity in the same way: 

i?(90°e^) = i?(-90°e^)i?(90"ey)i?(90°e2) (333) 

This identity is actually trivial: a rotation round the X axis is the same as a rotation round the Y axis, if we change 
to a different reference frame. 

Alternatively, we can look at the above as a special case of a "Euler rotation". Eulcr showed that any rotation can 
be assembled from rotations round the Y axis and rotations round the Z axis: 

R = e-i°'J'e-'PJye-i'iJ' (334) 



The validity of the idea is obvious, but finding the Euler angles can be complicated. 



[13] Building the representations of rotations 
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== [13.1] Irreducible representations 

A reducible representation is a representation for which a basis can be found in which each matrix in the group 
decomposes into blocks. In other words, each matrix can be separated into sub-matrices. Each set of sub-matrices 
must fulfill the multiplication table of the group. To decompose a representation means that we change to a basis in 
which all the matrices decompose into blocks. Only in a commutative group a basis can be found in which all the 
matrices arc diagonal. So, wc can say that a representation of a commutative group decomposes into one-dimensional 
representations. The rotation group is not a commutative group. Wc arc interested in finding all the irreducible 
representations of the rotation group. All the other representations of the rotation group can be found in a trivial 
way by combining irreducible representations. 

We are interested in finding representations of the rotation group. We will assume that someone has given us a "gift": 
a specific representation of the rotation group. We want to make sure that this is indeed a "good gift". We will check 
if the representation is reducible. Maybe we have "won the lottery" and had received more than one representation? 
Without loss of generality, we will assume that we only had received one (irreducible) representation. We will try 
to discover what are the matrices that we had received. We will see that it is enough to know the dimension of 
the representation in order to determine what are the matrices. In this way we will convince ourselves that there is 
only one (irreducible) representation for each dimension, and that we have indeed found all the representations of the 
rotation group. 



== [13.2] First Stage - determination of basis 

If we had received a representation of the rotation group then we can look at infinitesimal rotations and define 
generators. For a small rotation around the X axis, we can write: 

U{5^e^) = i - i5J^ (335) 

In the same way we can write rotations round the Z and Y axes. So, we can find the matrices Jx, Jy, Jz- How can 
we check that the representation that we have is indeed a representation of the rotation group? All we have to do is 
check that the following equation is fulfilled: 

[Ji, Jj] = ieijkJk (336) 



We will also define: 



J± 



(337) 



1 



We notice that the operator commutes with all the generators, and therefore also with all the rotation matrices. 
From the "separation of variable" theorems it follows that if has (say) two different eigenvalues, then it induces a 
decomposition of all the rotation matrices into two blocks. So in such case the representation is reducible. Without 
loss of generality our interest is focused on irreducible representations for which we necessarily have = Al, where A 
is a constant. Later we shall argue that A is uniquely determined by the dimension of the irreducible representation. 

If we have received a representation as a gift, wc still have the freedom to decide in which basis to write it. Without 
loss of generality, we can decide on a basis that is determined by the operator J^: 



Jzjm) = m\m) 



(338) 
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Obviously, the other generators, or a general rotation matrix will not be diagonal in this basis, so we have 

{m\P\m') = \5ram' (339) 
(mlJ^lm') = mSjnm' 
{m\R\m') = Rl„^, 



13.3] Reminder: Ladder Operators 



Given an operator D (which does not have to be unitary or Hermitian) and an observable x that fulfill the commutation 
relation 

[x, b] = ab (340) 

we will prove that the operator D is an operator that changes (increments or decrements) eigenstates of x. 

xb~ bx^ ab (341) 
xb = b{x + a) 
xb\x) = b{x + a)\x) 
xb\x) = b{x + a)\x) 
x[b\x)] = {x + a)[b\x)] 

So the state \'i>) = b\x) is an eigenstate of x with eigenvalue {x + a). The normalization of j^*) is determined by: 

= (^-l^-) = (a;|i)tz)|a;) (342) 



== [13.4] Second stage: identification of ladder operators 

It follows from the commutation relations of the generators that: 

[J„J±] = ±J± (343) 

So J± are ladder operators in the basis that we are working in. By using them we can move from a given state |m) 
to other eigenstates: |to — 2), |m — 1), |m + 1), |m + 2), |m + 3), .... 

From the commutation relations of the generators 

( J+J_) - (J„J+) = [J+, J„] = 2J, (344) 
From the definition of 

(J+J„) + (J_ J+) = 2(J2 - (J,)2) (345) 
By adding/subtracting these two identities we get: 

LJ+^J^-X{1 + 1) (346) 
J+L = - - I) 
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Now we can find the normalization of tfie states that are found by using the ladder operators: 

||J+|m)|| = {m\J-J+\m) = {m\.P\m) - {m\J ^{J ^ + l)\m) = \ - m(m + I) (347) 
II J_|m)|| = (m| J_|_ J_|m) — (m| J^|m) — {m\Jz{Jz — 1)|to) = A — m(m — 1) 

It will be convenient from now on to write the eigenvalue of as A = i{j + I). Therefore: 



J+|m) = ^/j{j + l)-m{m + l)\m + I) (348) 
J_|m) = \/ iij + I) — m{m — l)\m — I) 



== [13.5] Third stage - deducing the representation 

Since the representation is of a finite dimension, the process of incrementing or decrementing cannot go on forever. 
By looking at the results of the last section we may conclude that there is only one way that the incrementing could 
stop: at some stage we get m = +j. Similarly, there is only one way that the decrementing could stop: at some stage 
we get m = —j. Hence in the incrementing/decrementing process we get a ladder that includes 2j + I states. This 
number must be an integer number. Therefore j must be either an integer or half integer number. 

For a given j the matrix representation of the generators is determined uniquely. This is based on the formulas of the 
previous section, from which we conclude: 



[J+\m'm = \/j(j + 1) - "T-("l + l)5m',ra+l 
[J-]m'm = v7CH-^T^-^^^(^^^^-^'^m%m-l 



(349) 



And all that is left to do is to write: 



[J, 



x\m' m 



I r 

2 
I 

'y\m'm — ^ 



ni'rn m 



(350) 



And then we get every rotation matrix in the representation by: 

-Rm'™ = e-'*-^^ (351) 



A technical note: In the raising/lowering process described above we got "multiplets" of m state. It is possible 
that we will get either one multiplet, or several multiplets of the same length. In other words it is possible that the 
representation will decompose into two identical representations of the same dimension. Without loss of generality 
we assume that we deal with an irreducible representation, and therefore there is only one multiplet. 
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[14] Rotations of spins and of wavefunctions 



[14.1] Building the dim=2 representation (spin 1/2) 



Let us find the j = 1/2 representation. This representation can be interpreted as a reahzation of spin 1/2. We 
therefore we use from now on the notation S instead on J. 

S'\m)^^i^ + l)\m) (352) 

Using formulas of the previous section we find 5'+ and S- and hence Sx and Sy 

= (0 J) (353) 

- ^ (? 2) (3-) 

= ^((n n) + (i n)) = (i l) (355) 

S'y = ( i n] = 7^<^y (356) 



^0 1^ 
















1 




7) 


= 2^. 





We recall that 

i?($) = i?($r?) = e"**^" = cos($/2)i-isin($/2)cr„ (357) 

where 

n = (sin cos </?, sin0sin(y9, coaO) 

'^" = '^-^=(,e»^sin0 -COS0 j (3^8) 

Hence 

^ /cos(cI>/2)-*cos(0)sin($/2) -ie"'^ sin(0) sin($/2) \ 

-ie"^sin(6l)sin($/2) cos(<I>/2) + i cos(6') sin($/2)y ^ > 

In particular a rotation around the Z axis is given by: 

i? = e-'*^^ = ('""f' (360) 



e^*/2^ 

And a rotation round the y axis is given by 



R_p-.$s„ _ /cos(<i>/2) -sin($/2) 
Uin($/2) cos($/2) 



(361) 
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== [14.2] Polarization states of Spin 1/2 

We now discuss the physical interpretation of the "states" that the s = 1/2 matrices rotate. Any state of "spin 1/2" is 
represented by a vector with two complex number. That means we have 4 parameters. After gauge and normalization, 
we are left with 2 physical parameters which can be associated with the polarization direction (0, if). Thus it makes 
sense to represent the state of spin 1/2 by an arrow that points to some direction in space. 

The eigenstates of Sz do not change when we rotate them around the Z axis (aside from a phase factor). Therefore 
the following interpretation comes to mind: 

'4) -ID-fj) (362) 



2/ ' ' VO 

This interpretation is confirmed by rotating the "up" state by 180 degrees, and getting the "down" state. 



We see that: 




(364) 



With two rotations of 180° we get back the "up" state, with a minus sign. Optionally one observes that 



and hence by similarity this holds for any 27r rotation. We see that the representation that we found is not a one-to- 
one representation of the rotation group. It does not obey the multiplication table in a one-to-one fashion! In fact, 
we have found a representation of SU{2) and not 50(3). The minus sign has a physical significance. In a two slit 
experiment it is possible to turn destructive interference into constructive interference by placing a magnetic field in 
one of the paths. The magnetic field rotates the spin of the electrons. If we induce 360° rotation, then the relative 
phase of the interference change sign, and hence constructive interference becomes destructive and vice versa. The 
relative phase is important! Therefore, we must not ignore the minus sign. 

It is important to emphasize that the physical degree of freedom that is called "spin 1/2" cannot be visualized as a 
arising from the spinning of small rigid body around some axis like a top. If it were possible, then we could say that 
the spin can be described by a wave function. In this case, if we would rotate it by 360° we would get the same state, 
with the same sign. But in the representation we are discussing we get minus the same state. That is in contradiction 
with the definition of a (wave) function as a single valued object. 

We can get from the "up" state all the other possible states merely by using the appropriate rotation matrix. In 
particular we can get any spin polarization state by combining a rotation round the Y axis and a rotation round the 
Z axis. The result is: 

\eo,,) = R{^)Rm t) = e--^^e-'^^« | T) - ["'J^l' Zfe/2)) ^^66) 
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== [14.3] Building the dim=3 representation (spin 1) 

Let us find the j = 1 representation. This representation can be interpreted as a realization of spin 1, and hence we 
use the notation S instead of J as in the previous section. 



^4 




(367) 



So, the standard representation is: 



S 




(368) 



We remember that the Euclidean representation is: 



M 




(369) 



Now we have two different dim=3 representations that represent the rotation group. They are actually the same 
representation in a different basis. By changing bases (diagonalizing Mz) it is possible to move from the Euclidean 
representation to the standard representation. It is obvious that diagonalizing Mz is only possible over the complex 
field. In the defining realization, the matrices of the Euclidean representation rotate points in the real space. But it 
is possible also to use them on complex vectors. In the latter case it is a realization for spin 1. 

For future use we list some useful matrices: 




Sy 




Sz 




(370) 



From this: 




1 




2 





-1 


1 




(371) 



And as expected from {£,m\S^\£' ,m') ~ £{£ + l)Se,i'Sm,m' we get 




(372) 



Having found the generators we can construct any rotation of spin 1. We notice the following equation: 

Sf = SfS^ = for i^x,y,z (373) 
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From this equation we conclude that all the odd powers (1, 3, 5, ...) are the same and are equal to Si, and all the even 
powers (2,4,6, ...) are the same and equal to Sf. It follows (by way of a Taylor expansion) that: 



[/($) 



-%<!>■ S 



i-isin($)5„-(l-cos($))5'2 



(374) 



Where: 



Sr, — n ■ S 



(375) 



Any rotation can be given by a combination of a rotation round the z axis and a rotation round the y axis. We will 
mark the rotation angle round the y axis by and the rotation angle round the z axis by ip, and get: 



U{9ey) = e-''*'^" 



'e-'^ 
1 

0**^, 



sm u 

yi(i - cos( 



(376) 



/i(l + cos6') -^sin6' i(l-cos( 



1 



sm ( 



i(l + cos( 



[14.4] Polarization states of a spin 1 



The states of "Spin 1" cannot be represented by simple arrows. This should be obvious in advance because it is 
represented by a vector that has three complex components. That means we have 6 parameters. After gauge and 
normalization, we will still have 4 physical parameters. Hence it is not possible to find all the possible states of spin 1 
by using only rotations. Below we further discuss the physical interpretation of spin 1 states. This discussion suggest 
to use the following notations for the basis states of the standard representation: 



Vn. = 1) = |e.) = I ^) 



(377) 



|m = 0)= |e-,) =\\) 



= -!)= l-e.) =14) 



The first and the last states represent circular polarizations. By rotating the first state by 180° around the Y axis 
we get the third state. This means that we have 180° degree orthogonality. However, the middle state is different: it 
describes linear polarization. Rotating the middle state by 180° degrees around the Y axis gives the same state again! 
This explains the reason for marking this state with a double headed arrow. 

In order to get further insight into the variety of polarization states we can define the following procedure. Given an 
arbitrary state vector (ip^jtpQjtp-) we can re-orient the z axis in a {0,ip) direction such that V'o = 0. We may say 
that this {0,(p) direction define a "polarization plane". Using (/) rotation around the new Z axis, and taking gauge 
freedom into account we can bring the state vector into the form (cos(q), 0, sin((7)), where without loss of generality 
< q < 7r/2. Thus we see that indeed an arbitrary state is characterized by the four parameters {6, (p, 0, q). In the 
most general case we describe the polarization as elliptic: {9, (p) defines the plane of the ellipse, </> describes the angle 
of major axes in this plane, and q describes the ratio of the major radii. It is important to realize that 180° rotation 
in the polarization plane leads to the same state (up to a sign). The special case g = is called circular polarization 
because any rotation in the polarization plane leads to the same state (up to a phase). The special case q = t:/2 is 
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called linear polarization: the ellipse becomes a double headed arrow. Note that in the latter case the orientation of 
the polarization plane is ill defined. 

If we rotate the linear polarization state | 'i^) by 90°, once around the Y axis and once around the X axis, we get an 
orthogonal set of states: 



-1' 

I 11/ . , V/; - ^ , 



|e-.) = i=M^) + l^» - 4 I I (378) 



;;5(liH + l»)";^|o 




This basis is called the linear basis. States of "spin 1" can be written either in the standard basis or in the basis 
of linear polarizations. The latter option, where we have 90° orthogonality of the basis vectors, corresponds to the 
Euclidean representation. 

We can rotate the state | tf) in order to get other circularly polarized states: 



\ee.i 



'i(l +cos6l)e-*'^ 
^ sin 61 
i(l -cos6')e''^ 



(379) 



Similarly, we can rotate the state | 1^) in order to get other linearly polarized states: 



Ui^e,)Ui9ey)\ t) 



-^sin0e-'-N 
cos 9 

v2 



(380) 



The circularly polarized states are obtained by rotating the | ff) state, while the linearly polarized states are obtained 
by rotating the | |[) state. But a general polarization state will not necessarily be circularly polarized, neither linearly 
polarized, but rather elliptically polarized. As explained above it is possible to find one-to-one relation between 
polarization states of spin 1 and ellipses. The direction of the polarization is the orientation of the ellipse. When 
the ellipse is a circle, the spin is circularly polarized, and when the ellipse shrinks down to a line, the spin is linearly 
polarized. 



^== [14.5] Translations and rotations of wavefunctions 

We first consider in this section the space of functions that live on a torus (bagel) . We shall see that the representation 
of translations over this space decomposes into one-dimensional irreducible representations, as expected in the case of 
a commutative group. Then we consider the space of functions that live on the surface of a sphere. We shall see that 
the representation of rotations over this space decomposes as 1 © 3 © 5 © . . .. The basis in which this decomposition 
becomes apparent consists of the spherical harmonics. 

Consider the space of functions that live on a torus (bagel). This functions can represent the motion of a particle in a 
2-D box of size x Ly with periodic boundary conditions. Without loss of generality we assume that the dimensions 
of the surface are Lx = Ly = 27r, and use x = {9, ip) as the coordinates. The representation of the state of a particle 
in the standard basis is: 



1*) = Y,i^{9,^)\e,^) 



(381) 
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The momentum states are labeled as fc = (n^m). The representation of a wavcfunction in this basis is: 

I*) = 51*"^™!"'™) (3^2) 

n,m 

where the transformation matrix is: 

{e,ip\n,m) = e*M+™v) (383) 
The displacement operators in the standard basis are not diagonal: 

i?x,x' = S{e - {9' + a))S{ip - [ip' + h)) (384) 
However, in the momentum basis we will get diagonal matrices: 

Dk.k' = ^„.n'^,n,m'e-'(""+^™) (385) 

In other words, we have decomposed the translations group into 1-D representations. This is possible because the 
group is commutative. If a group is not commutative it is not possible to find a basis in which all the matrices of the 
group are diagonal simultaneously. 

Now we consider the space of functions that live on the surface of a sphere. This functions can represent the motion 
of a particle in a 2-D spherical shell. Without loss of generality we assume that the radius of the sphere is unity. In 
full analogy with the case of a torus, the standard representation of the states of a particle that moves on the surface 
of a sphere is: 

\^) ^Y.'^{e,^)\e,^) (386) 

Alternatively, we can work with a different basis: 

I*) = ;^«',„|£,m) (387) 

where the transformation matrix is: 

(0,^|^,m) = y^"(0,^) (388) 
The "displacement" matrices are actually "rotation" matrices. They are not diagonal in the standard basis: 

i?x,x' = 5{Q - .f{0', ^'m^ - 9{0'. v')) (389) 

where /() and g{) are complicated functions. But if we take y^™(0, Lp) to be the spherical harmonics then in the new 
basis the representation of rotations becomes simpler: 

/I X 1 \ 

R£m,i'm' = Q ^0^5x5 "" diagonal (390) 

\ 

When we rotate a function, each block stays "within itself. The rotation does not mix states that have different £. In 
other words: in the basis \(,m) the representation of rotations decomposes into a sum of irreducible representations 
of finite dimension: 

1©3©5©... (391) 
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In the next section we show how the general procedure that we have learned for decomposing representations, does 
indeed help us to find the (p) functions. 



[14.6] The spherical harmonics 



We have already found the representation of the generators of rotations over the 3D space of wavefunctions. Namely 
we have proved that L = r x p. If we write the differential representation of L is spherical coordinates we find as 
expected that the radial coordinate r is not involved: 



dip 

X,^e--(44-.cot(.)^ 

Id d 1 92 



sm{e) de 



89' sin'ed^^ 



(392) 
(393) 
(394) 



Thus the representation trivially decompose with respect to r, and without loss of generality we can focus on the 
subspace of wavefunctions '5(6', ip) that live on a spherical shell of a given radius. We would like to find the basis in 
which the representation decompose, as defined by 



The solution is: 



2e + i {e-my. 

Att {£ + m)l 



1/2 



[{-irPe,n{cos{e))] e™-^ 



(395) 
(396) 



(397) 



It is customary in physics textbooks to "swallow" the factor (—1)™ in the definition of the Legendre polynomials. We 
note that it is convenient to start with 



Y"{9,(f>) cx {sm{0)Ye'^^ 
and then to find the rest of the functions using the lowering operator: 

\£,m) cx 



(398) 



(399) 



Let us give some examples for Spherical Functions. The simplest function is spread uniformly over the surface of the 
sphere, while a linear polarization state along the Z axis is concentrated mostly at the poles: 



1 



Y 



1,0 



■ cos(e') 



/47r' V 47r 

If we rotate the polar wave function by 90 degrees we get: 



(400) 



■ sin(6') cos((y3). 



■ sin(0) sin((/3) 



While according to the standard "recipe" the circular polarizations are: 



(401) 



yl.l 



■sin(6')e''^. 



Y 



1,-1 



■ sin(6')e" 



-tip 



(402) 



[15] Multiplying representations 
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[15.1] Multiplying representations 



Let us assume we have two Hilbert spaces. One is spanned by the basis \i) and the other is spanned by the basis |a). 
We can muUiply the two spaces "extcrnaUy" and get a space with a basis defined by: 



I, a 



(403) 



The dimension of the Hilbert space that we obtain is the multiphcation of the dimensions. For example, we can 
multiply the "position" space x by the spin space m. We will assume that the space contains three sites x = 1,2, 3, 
and that the particle has spin i with mi = The dimension of the space that we get from the external 

multiplication is 2 x 3 = 6. The basis states are 



\x, m) — \x) (g) \m) 
A general state is represented by a column vector: 



(404) 



I*) 



/*it\ 



^2T 
2i 



(405) 



Or, in Dirac notation: 



(406) 



If X has a continuous spectrum then the common notational style is 



I*) =Y,'^^n{x)\x,m) 



*m(2;) = 



_ f^^ix) 



(407) 



If we prepare separately the position wavcfunction as tpx and the momentum polarization as Xm, then the state of 
the particle is: 



I*) = l^)®lx) 



IpxXn 



/'0iXt\ 

'4>ixi 

"02 Xi 

V'sXT 



(408) 



It should be clear that in general an arbitrary cannot be written as a product state of some 1-0) with some |x). 
In we have a non-factorized preparation of the particle we say that the its space and its spin degrees of freedom were 
entangled. 



[15.2] External multiplication of operators 



Let us assume that in the Hilbert space that is spanned by the basis |a), an operator is defined: A Aap- And in 
another Hilbert space that is spanned by the basis an operator is defined: B Bij. An operator C = B ® A m 
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the new space is defined as follows: 



(409) 



In Dirac notation: 



{ia\C\jl3) = {t\B\j){a\m 



(410) 



For example, let us assume that we have a particle in a three-site system: 




(411) 



ll> 



I2> 



I3> 



If the particle has spin i we must define the position operator as: 



X = X (E)i 



(412) 



That means that: 



x\x, m) 



(413) 



And the matrix representation is: 




/I 0\ 

1 

2 






2 



3 
VO 3/ 



(414) 



The system has a 6 dimensional basis. We notice that in physics textbooks there is no distinction between the notation 
of the operator in the original space and the operator in the space that includes the spin. We must understand the 
"dimension" of the operator representation by the context. A less trivial example of an external multiplication of 
operators: 












I) 


• 


2 


i 









3/ 





/2 
1 





8 4 
V4 8 



8 4\ 
4 8 






4 2 
2 4 
6 3 
3 6/ 



(415) 



Specifically, we see that if the operator that we are multiplying externally is diagonal, then we will get a block diagonal 
matrix. 
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== [15.3] External multiplication of spin spaces 

Let us consider two operators in different Hilbert spaces L and S. The bases of the spaces are Im^), |ms). The 
eigenvalues are = 0, ±1 and = ±1/2. We wiU mark the new states as follows: | -fl-f), | tfi), | |[t)j I I -D-T); 

I -IJ-i) The system has a basis with 6 states. Therefore, every operator is represented by a 6 x 6 matrix. We will define, 
as an example, the operator: 

.h = S,+ L, (416) 
The three operators operate in 6-D space. A mathematician would write it as follows: 

J:c = ^® Sx+ L^®i (417) 
In the next sections we learn how to make the following decompositions: 

2®2 = 1©3 (418) 

2(g)3 = 2©4 

rep. over sphere = 1©3©5©7©... 

The first decomposition will be used in connection with the problem of two particles with spin i , where we can define 
a basis that includes three symmetrical states (the "triplet") and one anti-symmetric state (the "singlet"). The second 
example is useful in analyzing the Zceman splitting of atomic levels. The third example, which is the decomposition of 
the representation of the rotation group over function space, is best illustrated by considering the motion of a particle 
on a spherical shell. 

=== [15.4] Rotations of a Composite system 



We assume that we have a spin £ entity whose states are represented by the basis 

\m^ = + tj (419) 
and a spin s entity whose states are represented by the basis 

\ms = -S... + s) (420) 
The natural [21 + 1) x (2s + 1) basis for the representation of the composite system is defined as 

\mi,ms) = \mi) ®\ms) (421) 
A rotation of the £ entity is represented by the matrix 

R = R^® \ = e-'*^"®i = c-**^"®^ (422) 

We have used the identity f{A) ®l = f{A®\) which is easily established by considering the operation of both sides 
on the basis states \mi,ms)- More generally we would like to rotate both the £ entity and the s entity. This two 
operations commute since they act on different degrees of freedom (unlike two successive rotation of the same entity). 
Thus we get 
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where 

J = L<E)i + i(g)S = L + S (424) 

From now on we use the conventional sloppy notations of physicists as in the last equality: the space over which the 
operator operates and the associated dimension of its matrix representation are implied by the context. Note that in 
full index notations the above can be summarized as follows: 

{mem,\R\m'gm'^) = RLi,m',Ki,,m'^ (425) 
{mims\Ji\mfm'^) = [Li\ (426) 

It is important to realize that the basis states are eigenstates of but not of J^. 

Jz\mi,ms) = {me + ms)\mi,ms) = mj\me,ms) (427) 
J^|mf,ms) = superposition |mf [±1], TOs[=pl]^ (428) 

The second expression is based on the observation that 

J2 = J2 + 1 ( j_ + j_ ) (429) 
J± = L± + S± (430) 

This means that the representation is reducible, and can be written as a sum of irreducible representations. Using 
the conventional procedure we shall show in the next section that 

(2£+l)® (2s + l) = (2|£ + s| + l)® •••® (2|Z-s| + l) (431) 

We shall call it the "addition of angular momentum" theorem. The output of the "addition of angular momentum" 
procedure is a new basis |j, mj) that satisfies 

J^\j,mj) = j{j + l)\j,m,) (432) 
Jz\j,mj) = ■mj\j,mj) (433) 

We shall see how to efficiently find the transformation matrix between the "old" and the "new" bases. The entries 
of the transformation matrix are called the Clebsch-Gordan-Racah Coefficient and are commonly expressed using the 
Wigner 3j-symbol: 



Tra,m.,,m, = {me,m,\j,m,) = (-l)'"'+"V2j+T( (434) 

Note that the Wigner 3j-symbol is non-zero only if its entries can be regarded as the sides of a triangle which is formed 
by 3 vectors of length £ and s and j whose projections mi and m,, and — m sum to zero. This geometrical picture is 
implied by the addition of angular momentum theorem. 

With the Clebsch-Gordan-Racah transformation matrix we can transform states and the operators between the two 
optional representations. In particular, note that is diagonal in the "new" basis, while in the "old basis" it can be 
calculated as follows: 

[J^]oid basis {m'/,,m'Jj'^\me,ms) = {m'i,m'Jj' ,m'.){j' ,m'Aj'^\j,77ij){j,m.j\mi,ms) = r[J^]<ji,g„„ir^ (435) 
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We shall see that in practical applications each representation has its own advantages. 



[15.5] The inefficient decomposition method 



Let us discuss as an example the case £ = I and s = 1/2. In the natural basis we have 
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(436) 



(437) 



etc. In order to find we apparently have to do the following calculation: 



J' 



(438) 



The simplest term in this expression is the square of the diagonal matrix 



Sz —f [6x6 matrix] 



(439) 



We have two additional terms that contain non-diagonal 6x6 matrices. To find them in a straightforward fashion 
can be time consuming. Then we have to diagonalize so as to get the "new" basis. 

In the next section we explain the efficient procedure to find the "new" basis. Furthermore, it is implied by the 
"addition of angular momentum" theorem that 2 = 4© 2, meaning that we have a j = 3/2 subspacc and a j = 1/2 
subspace. Therefore it is clear that after diagonalization we should get. 



J' 
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(15/4) 

(15/4) 

(15/4) 
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This by itself is valuable information. Furthermore, if we know the transformation matrix T we can switch back to 
the old basis by using a similarity transformation. 



[15.6] The efficient decomposition method 



In order to explain the procedure to build the new basis we will consider, as an example, the addition of ^ = 2 and 
s = |. The two graphs below will serve to clarify this example. Each point in the left graph represents a basis state 

const = rrij. 



in the |m^,ms) basis. The diagonal lines connect states that span subspaces. That means mg + ma 



Let us call each such subspace a 
contains only one state. 



floor". The upper floor m. 



s contains only one state. The lower floor also 
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5/2 
1'2 

~~», ~~», ~~» -1/2 

-3/2 
-5/2 

• • • • • 

-7/2 



-2-1 1 2 nij 1/2 3/2 5/2 7/2 j 

We recall that 

Jz\mi,ms) = {rnt + ms)\'mt,ms) (441) 

S-\mi,ms) ~ \/ s(s + 1) — rns{rns — l)\mi, nis — 1) (442) 

L_|m£, m,,) = v^£(£ + 1) - m^(TOf - l)|m^ - 1, m^) (443) 

J_ = S"- + L_ (444) 

J2 = j2 + l(j^j_ + ,/_j^) (445) 



Applying J_ or J-|_ on a state takes us either one floor down or one floor up. By inspection we see that if operates 
on the state in the upper or in the lower floor, then we stay "there". This means that these states are eigenstates of 
j2 corresponding to the eigenvalue j = 1 + s. Note that they could not belong to an eigenvalue j > £ + s because this 
would imply having larger (or smaller) rrij values. 

Now we can use J_ in order to obtain the multiplet of j = £ + s states from the rrij ~ £ + s state. Next we look at 
the second floor from above and notice that we know the \ j = £ + s, rrij ^ 1+ s — 1) state, so by orthogonalization we 
can find the \i = (. + s — 1, rrij = £ + s — 1) state. Once again we can get the whole multiplet by applying J_. Going 
on with this procedure will give us a set of states as arranged in the right graph. 

By suggesting the above procedure we have in fact proven the "addition of angular momentum" statement. In the 
displayed illustration we end up with 4 multiplets (j = |, f , |, 5) so we have 5(8)4 = 8©6©4©2. In the following 
sections we review some basic examples in detail. 

== [15.7] The case of 202 = 301 

Consider the addition of £ = i and 5=5 (for example, two electrons). In this case the "old" basis is 



|m,,m,) = | tt), I Ti), I it), I ii) (446) 
The "new" basis we want to find is 

Ii, m,) = |1,1),|1,0),|1,-1) , IM) (447) 

These states are called triplet and singlet states. It is very easy to apply the procedure as follows: 

= I tt) (448) 

|i,o) « J_| TT) = I TT> + UT> (449) 

cx J_J_| TT> =2| ii) (450) 
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By orthogonaliztion we get the singlet state, which after normahzation is 



|o,o) = -^(|n>-UT)) 

Hence the transformation matrix from the old to the new basis is 
/ 1 \ 

n 1 n 1 



75 75 
/ 75 -75 , 

Vo 1 y 



The operator in the \mi, nis) basis is 
{m'^,m'JJ^\mi,ms) = T 
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m, 



[15.8] The case of 30 2 = 40 2 



Consider the composite system of ^ = 1 and s = ^ . 
In this case the "old" basis is 

|m,,m,) = |^t>,l^i>,|$T>,l$i),I^T),l^i) 
The "new" basis we want to find is 



(454) 



(455) 



It is very easy to apply the procedure as in the previous section. All we have to remember is that the lowering operator 
L_ is associated with the V2 prefactor: 



li,i> = I^T) 

(X J-|^T> = l^i> + V2|^t> 

(X J_J_|^T) =2V2|$i) + 2|^T> 



(456) 
(457) 
(458) 
(459) 
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By orthogonalization wc get the starting point of the next multiplet. and then we use the lowering operator again: 



1^,^) (X -V2\ti) + \t1) 



(460) 
(461) 



Hence the transformation matrix from the old to the new basis is 
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and the operator in the \mi, nis) basis is 
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{m'i,m'Jj'^\mi,ms) =T 
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This calculation is done in the Mathematica file zeeman.nb. 
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[15.9] The case of (2£ + 1) 2 = {2i + 2) {2i) 



The last example was a special case of a more general result which is extremely useful in studying the Zeeman Effect 
in atomic physics. We consider the addition of integer £ (angular momentum) and s — ^ (spin). The procedure is 



exactly as in the previous example, leading to two multiplets: The j 
The final expression for the new basis states is: 



h multiplet and the j 



h multiplet. 



j = e--,m} = -a 



1 t 



m , t 

2 



(464) 
(465) 
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where 



. ^ ^^^^ 

The =b signs are for the respective two multiplets. The transformation matrix between the bases has the structure 
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[16] Galilei group and the non-relativistic Hamiltonian 
== [16.1] The Representation of the GaUlei Group 

The defining realization of the Galilei group is over phase space. Accordingly, that natural representation is with 
functions that "live" in phase space. Thus the a-translated p{x, v) is p{x — a, v) while the u-boosted p{x, v) is p{x, v~u) 
etc. 

The generators of the displacements are denoted Pa;,Pj,,Pz, the generators of the boosts are denoted Qa;,Qj^,Qz, 
and the generators of the rotations are denoted J^;, Jj,, J^. Thus we have 9 generators. It is clear that translations 
and boosts commute, so the only non-trivial structure constants of the Lie algebra have to do with the rotations: 



[P,,P,]= (469) 

[Q„Q,]= (470) 

[Pj,Qj]=0 (to be discussed) (471) 

[J„ A,] = ieyfcAfc for A = P,Q,J (472) 



Now we ask the following question: is it possible to find a faithful representation of the Galilei group that "lives" 
in configuration space. We already know that the answer is "almost" positive: We can represent pure quantum 
states using "wavefunctions" ip{x). These wavefunctions can be translated and rotated. On a physical basis it 
is also clear that we can talk about "boosted" states: this means to give the particle a different velocity. So we 
can also boost wavefunctions. On physical grounds it is clear that the boost should not change |7/'(a;)p. In fact 
it is not difficult to figure out that the boost is realized by a multiplication of ipi^) by c'^"^")^. Hence we get the 
identifications i-^- —i{d/dx) and Qx ^-^ —ma; for the generators. Still the wise reader should realize that in this 
" new" representation boosts and translations do not commute, while in case of the strict phase space realization they 
do commute! 

On the mathematical side it would be nice to convince ourselves that the price of not having commutation between 
translations and boosts is inevitable, and that there is a unique representation (up to a gauge) of the Galilei group using 
" wavefunctions" . This mathematical discussion should clarify that the " compromise" for having such a representation 
is: (1) The wavefunctions have to be complex; (2) The boosts commute with the translations only up to a phase factor. 
We shall sec that the price that we have to pay is to add 1 as a tenth generator to the Lie algebra. This is similar 
to the discussion of the relation between SO (3) and SU(2). The elements of the latter can be regarded as "rotations" 
provided we ignore an extra " sign factor" . Here rather than ignoring a " sign factor" we have to ignore a complex 
" phase factor" . 

Finally, we shall see that the most general form of the non-relativistic Hamiltonian of a spinless particle, and in 
particular its mass, are implied by the structure of the quantum Lie algebra. 

== [16.2] The Mathematical Concept of Mass 

An element r of the Galilei group is parametrized by 9 parameters. To find a strict (unitary) representation means 
to associate with each element a linear operator U{t) such that (g) = implies 

U{t^)U{t^) = U{t^) (473) 

Let us see why this strict requirement cannot be realized if we want a representation with " wavefunctions" . Suppose 
that we have an eigenstate of P such that P\k) = k\k). since we would like to assume that boosts commute with 
translations it follows that also Uboost\k) is an eigenstate of P with the same eigenvalue. This is absurd, because 
it is like saying that a particle has the same momentum in all reference frames. So we have to replace the strict 
requirement by 



[/(ri)[/(r2) =e"Pl^a^e[/(r3) 



(474) 
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This means that now we have an extended group that "covers" the Gahlei group, where we have an additional 
parameter (a phase), and correspondingly an additional generator (1). The Lie algebra of the ten generators is 
characterized by 

[G^,G,]=z^c^,Ga (475) 

A 

where Gq ^ 1 ^^nd the other nine generators are Pi,Qi, Ji with i = x,y, z. It is not difficult to convince ourselves 
that without loss of generality this introduces one "free" parameter into the algebra (the other additional structure 
constants can be set to zero via appropriate re-definition of the generators). The "free" non-trivial structure constant 
m appears in the commutation 

[P„ Qj] = im<5,y (476) 

which implies that boosts do not commute with translations. 

== [16.3] Finding the Most General Hamiltonian 



Assume that we have a spinless particle for which the standard basis for representation is |a;). With appropriate gauge 
of the X basis the generator of the translations is P i-^ —i{d/dx). From the commutation relation [P,Q] = im we 
deduce that Q = —mx + g{p), where g{) is an arbitrary function. With appropraite gauge of the momentum basis we 
can assume Q = —mx. 

The next step is to observe that the effect of a boost on the velocity operator should be 

Uboostiu)~'^vUboostiu) ^ V + u (477) 

which implies that [Q,v] — —i- The simplest possibility is v —p/m. But the most general possibility is 

i) = L{p^A{x)) (478) 
m 

where A is an arbitrary function. This time we cannot gauge away A. 

The final step is to recall the rate of change formula which implies the relation v = i[H,x]. The simplest operator 
that will give the desired result for v is H — ~ ^{x))"^ ■ But the most general possibility involves a second 

undetermined function: 

'H=^ip-A{x))' + V{x) (479) 



Thus we have determined the most general Hamiltonian that agrees with the Lie algebra of the Galilei group. In the 
next sections we shall see that this Hamiltonian is indeed invariant under Galilei transformations. 
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[17] Transformations and invariance 
== [17.1] Transformation of the Hamiltonian 

First we would like to make an important distinction between passive ["Heisenberg"] and active [" Schrodinger" ] points 
of view regarding transformations. The failure to appreciate this distinction is an endless source of confusion. 

In classical mechanics we are used to the passive point of view. Namely, to go to another reference frame (say a 
displaced frame) is like a change of basis. Namely, we relate the new coordinates to the old ones (say x = x — a), and 
in complete analogy we relate the new basis |i) to the old basis \x) by a transformation matrix T = e~*°P such that 
|a;) = T\x) = \x + a). 

However we can also use an active point of view. Rather than saying that we " change the basis" we can say that we 
"transform the wavefunction" . It is like saying that "the tree is moving backwards" instead of saying that "the car is 
moving forward". In this active approach the transformation of the wavefunction is induced by S" = T~^, while the 
observables stay the same. So it is meaningless to make a distinction between old (x) and new (i) coordinates! 

From now on we use the more convenient active point of view. It is more convenient because it is in the spirit of the 
Schrodinger (rather than Heisenberg) picture. In this active point of view observables do not transform. Only the 
wavefunction transforms ("backwards"). Below we discuss the associated transformation of the evolution operator 
and the Hamiltonian. 

Assume that the transformation of the state as we go from the "old frame" to the "new frame" is if.' = Sijj. The 
evolution operator that propagates the state of the system from to to t in the new frame is: 

Uit,to) = Sit)Uit,to)S-\to) (480) 

The idea is that we have to transform the state to the old frame (laboratory) by S~^, then calculate the evolution 
there, and finally go back to our new frame. We recall that the Hamiltonian is defined as the generator of the 
evolution. By definition 

U(t + 5t, to) = (1 - i6m{t))U{t, to) (481) 



Hence 



n 



Sito)U-'Sit)-^ 



(482) 



and we get the result 



n = sns-^ + i—s-^ 

ot 



(483) 



In practice we assume a Hamiltonian of the form TL = h{x,p; V, A). Hence we get that the Hamiltonian in the new 
frame is 

n = h(SxS-'^,SpS^^;V,A)+i—S-'^ (484) 

ot 

Recall that "invariance" means that the Hamiltonian keeps its form, but the fields in the Hamiltonian may have 
changed. So the question is whether we can write the new Hamiltonian as 



'H = h{x,p-A,V) 



(485) 
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To have "symmetry" rather than merely "invariance" means that the Hamiltonian remains the same with A = A and 
V = V . We are going to show that the following Hamiltonian is invariant under translations, rotations, boosts and 
gauge transformations: 

n = ^(f>-A{x))\v{x) (486) 

We shall argue that this is the most general non-relativistic Hamiltonian for a spinless particle. We shall also discuss 
the issue of time reversal (anti-unitary) transformations. 



[17.2] Invariance Under Translations 



T 
S 



D{a) 



'lap 



lap 



(487) 



The coordinates (basis) transform with T , while the wavefunctions are transformed with S. 

SxS-^ =x + a (488) 
Sfix,p)S-' = fiSxS-\SpS-') = f{x + a,p) 

(489) 

Therefore the Hamiltonian is invariant with 

V{x) = V{x + a) (490) 
A{x) = A{x + a) 



[17.3] Invariance Under Gauge 



T = e-''^^^) (491) 

S ^ e*'^^^) 

SxS~^ = X 

SpS-^ = p- WA{x) 

Sf{x,p)S-' = fiSxS-\SpS-') = f{x,p^ VA(x)) 
Therefore the Hamiltonian is invariant with 

V{x) = V{x) (492) 
A{x) = ^(a;) + VA(a;) 



Note that the electric and the magnetic fields are not affected by this transformation. 
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More generally we can consider time dependent gauge transformations with A{x,t). Then we get in the "new" 
Hamiltonian an additional term, leading to 

V{x) = V{x)-{d/dt)K{x,t) (493) 
A{x) ^ A{x) +VK{x,t) 

In particular we can use the very simple gauge K ~ ct in order to change the Hamiltonian by a constant {Ti — TL — c). 

17.4] Boosts and Transformations to a Moving System 



From an algebraic point of view a boost can be regarded as a special case of gauge: 

T = e*(™')"= (494) 

SxS^^ = X 
SpS^^ = p + vnu 

Hence V{x) = V{x) and A{x) = A{x) — mu. But a transformation to a moving frame is not quite the same thing. The 
latter combines a boost and a time dependent displacement. The order of these operations is not important because 
we get the same result up to a constant phase factor that can be gauged away: 

S — giphase(M) ^-i{mu)x ^i{ut)p (495) 

SxS~^ = X + ut 
SpS^^ = p + mu 

The new Hamiltonian is 

f)Q 1 

H = SnS-^ + i-g^S-^ = SnS-^ -up^ —{p - Aix))"^ + V{x) + const(u) (496) 

where 

V{x,t) = V{x + ut,t)-u-A{x + ut,t) (497) 
A{x,t) ^ A{x + ut,t) 

Thus in the new frame the magnetic field is the same (up to the displacement) while the electric field is: 
8 A 

£ = -— -WV = £ + UX B (498) 
ot 

In the derivation of the latter we used the identity 

V(u ■A)-{u- V)A = ux{V X A) (499) 

Finally we note that if we do not include the boost in 5", then we get essentially the same results up to a gauge. By 
including the boost we keep the same dispersion relation: If in the lab frame A ~ and we have v = p/m, then in 
the new frame we also have ^ = and therefore v ~ p/m still holds. 



[17.5] Transformations to a rotating frame 



Let us assume that we have a spinless particle held by a a potential V{x). Assume that we transform to a rotating 
frame. We shall see that the transformed Hamiltonian will have in it a Coriolis force and a centrifugal force. 
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The transformation that wc consider is 

S = e'(^*)-^ (500) 
The new Hamiltonian is 

n = SHS-^ +i—S-^ = —p^ + V{x)-n-L (501) 

It is imphcit that the new x coordinate is relative to the rotating frame of reference. Without loss of generality we 
assume ft = (0,0, £7). Thus we got Hamiltonian that looks very similar to that of a particle in a uniform magnetic 
field (see appropriate lecture): 

n = ±(p-A{x)r + V{x) = ^-^L^ + ^ia:^+y^) + Vix) (502) 

The Coriolis force is the "magnetic field" B — 2mil. By adding and subtracting a quadratic term we can write the 
Hamiltonian Ti. in the standard way with 

V = V -^mn^{x^ +y^) (503) 
A ^ A + md X r 

The extra — (l/2)mil^(a;^ + y^) term is called the centrifugal potential. 



=== [17.6] Time Reversal transformations 

Assume for simplicity that the Hamiltonian is time independent. The evolution operator is U = e~'^*. If we make a 
transformation T we get 

U = T-ie-'«*r = e-'(^"^^)* = e-''^* (504) 

where Ti. = T^^JiT. Suppose we want to reverse the evolution in our laboratory. Apparently we have to engineer 
T such that T^^HT = —H. If this can be done the propagator U will take the system backwards in time. Wc can 
name such T operation a "Maxwell demon" for historical reasons. Indeed for systems with spins such transformations 
have been realized using NMR techniques. But for the "standard" Hamiltonian there is a fundamental problem. 
Consider the simplest case of a free particle H = p^/{2m). To reverse the sign of the Hamiltonian means to make 
the mass m negative. This means that there is an unbounded spectrum from below. The feasibility of making such a 
transformation would imply that a physical system cannot get into thermal equilibrium. 

Digression: when Dirac found his Lorentz invariant Hamiltonian, it came out with a spectrum that had an unbounded 
set of negative energy levels. Dirac's idea to "save" his Hamiltonian was to assume that all the negative energy levels 
are full of particles. Thus we have a meaningful ground state. If we kick a particle from a negative to a positive 
energy level we create an electron-positron pair, which requires a positive excitation energy. 

But wc know from classical mechanics that there is a transformation that reverses the dynamics. All wc have to do 
is to invert the sign of the velocity. Namely p t-^ — p while x i-^- x. So why not to realize this transformation in 
the laboratory? This was Loschmidt's claim against Boltzman. Boltzman's answer was "go and do it". Why is it 
"difficult" to do? Most people will probably say that to reverse the sign of an Avogadro number of particles is tough. 
But in fact there is a better answer. In a sense it is impossible to reverse the sign even of one particle! If we believe 
that the dynamics of the system are realized by a Hamiltonian, then the only physical transformations are proper 
canonical transformations. In quantum mechanical language we say that any physical realizable evolution process is 
described by a unitary operator. We are going to claim that the transformation p t-^ —p while x t-^ x cannot be 
realized by any physical Hamiltonian. The time reversal transformations that we arc going to discuss are anti-unitary. 
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They cannot be reahzed in an actual laboratory experiment. This leads to the distinction between " microreversibility" 
and actual " reversibility" : It is one thing to say that a Hamiltonian has time reversal symmetry. It is a different story 
to actually reverse the evolution. 

Assume that we have a unitary transformation T such that TpT^^ = — p while TxT^^ = x. This would imply 
T[x,p]T~^ = — So we get i ~ —i. This means that such a transformation does not exist. But there is a way out. 
Wigner has proved that there are two types of transformations that map states in Hilbert space such that the overalp 
between states remains the same. These are either unitary transformations or antiunitary transformations. We shall 
explain in the next section that the "velocity reversal" transformation can be realized by an antiunitary rather than 
unitary transformation. We also explain that in the case of an antiunitary transformation we get 

U = r-ie-'«*r = e+*(^"'^^)* = e-''^* (505) 

where Ti = —T^^HT. Thus in order to reverse the evolution we have to engineer T such that T~^'HT — H, or 
equivalently [7i, T] = 0. If such a T exists then we say that Ti has time reversal symmetry. In particular we shall 
explain that in the absence of a magnetic field the non-relativistic Hamiltonian has a time reversal symmetry. 

== [17.7] Anti-unitary Operators 



An anti-unitary operator has an anti-linear rather than linear property. Namely, 

T {a 10) + P IV)) ^ a*T \cf,) + (3*T |V) (506) 

An anti-unitary operator can be represented by a matrix Tij whose columns are the images of the basis vectors. 
Accordingly \ip) = T\ip) implies (pi = Tijip* . So its operation is complex conjugation followed by linear transformation. 

It is useful to note that H = T~^HT imphes Hfj,i, = T*^H*jTj^. This is proved by pointing out that the effect of double 
complex conjugation when operating on a vector is canceled as far as its elements are concerned. 

The simplest procedure to construct an anti-unitary operator is as follows: We pick an arbitrary basis \r) and define 
a diagonal anti- unitary operator K that is represented by the unity matrix. Such operator maps ipr to ip*, and has 
the property = 1. Note also that for such operator H^j = H*j. In a sense there is only one anti-unitary operator 
per choice of a basis. Namely, assume that T is represented by the diagonal matrix {e"^''}. That means 

T \r) ^ c"!''' \r) (507) 

Without loss of generality we can assume that (p,- = 0. This is because we can gauge the basis. Namely, we can define 
a new basis |f) = e^^^^lr) for which 

T|f) =e*('^'-2^-) |f) (508) 

By setting Ar = 0r/2 we can make all the eigenvalues equal to one, and hence T ^ K. Any other antiunitary 
operator can be written trivially as T = [TK)K where TK is unitary. So in practice any T is represented by complex 
conjugation followed by a unitary transformation. Disregarding the option of having the "extra" unitary operation, 
time reversal symmetry T^^JiT ~ Ti means that in the particular basis where T is diagonal the Hamiltonian matrix 
is real (7i* ^ ~ Ti.r,s), rather than complex. 

Coming back to the "velocity reversal" transformation it is clear that T should be diagonal in the position basis (x 
should remain the same) . Indeed we can verify that such a T automatically reverses the sign of the momentum: 

\k)=Y,c^'^^\x) (509) 

X 

T\k) = ^Te*'^^ \x) = ^e-*'==^ \x) = \-k) 
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In the absence of a magnetic field the kinetic term in the Hamiltonian has symmetry with respect to this T. 
Therefore we say that in the absence of a magnetic field we have time reversal symmetry. In which case the Hamiltonian 
is real in the position representation. 

What happens if we have a magnetic field? Does it mean that there is no time reversal symmetry? Obviously in 
particular cases the Hamiltonian may have a different anti-unitary symmetry: if V{~x) = V{x) then the Hamiltonian 
is symmetric with respect to the transformation x i-^ —x while p i—f p. The anti-unitary T in this case is diagonal in 
the p representation. It can be regarded as a product of "velocity reversal" and "inversion" {x t-^ —x and p t-^ ^p)- 
The former is anti-unitary while the latter is a unitary operation. 

If the particle has a spin we can define K with respect to the standard basis. The standard basis is determined by 
X and (73 . However, T = K is not the standard time reversal symmetry: It reverse the polarization if it is in the 
Y direction, but leave it unchanged if it is in the Z or in the X direction. We would like to have T^^aT = —a. This 
implies that 

T = e-''''^yK = -iuyK (510) 

Note that = where N is the number of spin 1/2 particles in the system. This implies Kramers degeneracy 

for odd A^. The argument goes as follows: If i/; is an eigenstate of the Hamiltonian, then symmetry with respect to T 
implies that also Tip is an eigenstate. Thus we must have a degeneracy unless Tip = A-^, where A is a phase factor. 
But this would imply that T'^-ip = \^ip while for odd N we have T^ = —1. The issue of time reversal for particles with 
spin is further discussed in [Messiah p. 669]. 



Dynamics and Driven Systems 
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[18] Probabilities and FGR rates of transitions 
== [18.1] Time dependent Hamiltonians 



To find the evolution which is generated by a time independent Hamiltonian is relatively easy. Such a Hamiltonian 
has eigcnstatcs |n) which arc the "stationary" states of the system. The evolution in time of an arbitrary state is: 

l*W)=E^"''''''V'n|n) (511) 

n 

But in general the Hamiltonian can be time-dependent \H{ti) ^^{{t^)] ^ 0. In such case the strategy that was described 
above for finding the evolution in time loses its significance. In this case, there is no simple expression for the evolution 
operator: 

U{t) = {I - idtnUitn)) ■ ■ ■ {I - idt2n{t2)){l - idtin{ti)) ^ e^'-^o-«(*')dt' (512) 

We are therefore motivated to develop different methods to deal with driven systems. Below we assume that the 
Hamiltonian can be written as a sum of a time independent part TYq and a time dependent perturbation. Namely, 

n ^ Ha + V ^ Ho + f{t)W (513) 



== [18.2] The interaction picture 

We would like to work in a basis such that TCq is diagonal: 



Ho\n) = E^\n) (514) 

71 

The evolution is determined by the Schrodinger's equation: 

i^=E„4'n+J2Vnn''^n' (515) 

n' 

which can be written in a matrix style as follows: 



/l-i \ /^i^-i \ /Vn Vi2 

. a 
I — 
dt 



^2 = ^2*2 \ <\V21 V22 



\ 




(516) 



Without the perturbation we would get ipnit) = CnB where c„ are constants. It is therefore natural to use the 

variation of parameters method, and to write 

*«(t) = c„(i)e-'^"* (517) 
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In other words, we represent the "wave function" by the amphtudes Cn{t) = ^/'„(t)e*^"* rather than by the ampUtudcs 
^'„(t). The Schrodinger's equation in the new representation takes the form 

^^ = E«''''""''"'^'^»»'C"'W (518) 



This is called the Schrodinger's equation in the "interaction picture". It is a convenient equation because the term 
on the right is assumed to be "small". Therefore, the amplitudes c„(i) change slowly. This equation can be solved 
using an iterative scheme which leads naturally to a perturbative expansion. The iteration are done with the integral 
version of the above equation: 



Cn(t) = c„(0) - 



Vn, 



,{t')dt' 



(519) 



where Emi' = En — En'. In each iteration we get the next order. Let us assume that the system has been prepared 
in level uq. This means that the zero order solution is 

cM(0 = C„(0) = 6n,no (520) 



We iterate once and get the first-order solution: 



At) 



(521) 



So that the leading order is: 



Cn{t) ~ 1 forn — no (522) 



Cr) 



:„(t) = -iWn,no I f{t')e'^"'^ot'dt' Otherwise 







We notice that for very short times t ^ 1/ {En — En' ) we get c„(t) « —iVn,not which reflects the definition of the matrix 
elements of the Hamiltonian as a " hopping" amplitude per unit of time. For longer times the hopping amplitude is 
multiplied by a factor that oscillates at a frequency En — En' ■ This factor makes it " harder" for the particle to move 
between energy levels, since it does not allow the particle to "accumulate" amplitude. 

In order to illustrate the effect of the oscillating factor, consider a problem in which the unperturbed Hamiltonian is 
"to be in some site". We consider the possibility to move from site to site as a perturbation. The energy differences 
in the site problem are the "potential differences". Let us assume we have a step potential. Even though the 
hopping amplitude is the same in each "hop" (even if the hop is through a wall), the probability amplitude does not 
"accumulate". What stops the particle from crossing the step is the potential difference between the sites. 

== [18.3] The transition probability formula 

The expression we found for the transition amplitude using first-order perturbation theory can be written as: 



Cn{t) ~ ~lWn^nJT[f{t)] (523) 

Therefore, the transition probability is: 

Pt{n\m) « |W„,™p |FT[/(t)]|' (524) 
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Where the Fourier transform is defined by: 

/•OO 

FT[/(i)]= / /(t')c'^-*'di' (525) 



And we use the convention that f{t) = before and after the pulse. For example, if we turn on a constant perturbation 
for a certain amount of time, then f{t) is a rectangle function. 

== [18.4] The effect of a constant perturbation 



Wc consider the following scenario: A particle is prepared in the state rig, and then a constant perturbation is turned 
on for a time t. We want to know what is the probability of finding the particle at some later time in the state n. 
Using the transition probability formula we get 

Cn{t) = Wnno J, _j, (526) 

We notice that the transition amplitude is larger to closer levels and smaller for distant levels. 

1 _ ei(is„--E„o)* ^ 



Ptin\n„) = \Cn{t)\' = \Wnnor 



(527) 



In the next section we shall see that this expression can be regarded as a special case of a more general result. 



== [18.5] The effect of periodic driving 

We will now discuss a more general case: 

/(t') = e^'"*' for t' e [0,t] (528) 

We notice that the Hamiltonian should be hermitian. Therefore this perturbation has a physical meaning only if 
it appears together with a conjugate term 6+'^* . In other words, the driving is done by a real field cos{flt') that 
changes periodically. Below we will treat only "half of the perturbation. We can get the effect of the second half by 
making the swap Q t-^ — O. The calculation is done the same way as in the case of a constant perturbation. Using 
the transition probability formula we get 



Ptin\no) = \Wr,no\' 



1 _ g»(B„-B„o-n)t [2 



E„ — Eng — 51 
A more convenient way of writing this expression is: 



(529) 



P,,„i,,) . iw-,„.„r ^"-7'^'--^'»-"'"i . -/■■•) (530) 

[En - Eno - i^n " En^ - 

And another optional notation is: 

Pt{n\no) = \Wn,na\''t''smc''iiEn~En,-n)t/2) = 27t \Wn^„,\'' 52^/tiEn - E^, ~ Q) x t (531) 



Where: 



3inc(.) ^ ^ (532) 
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We have used the notation Se{uj) for a narrow function with width e. 



[18.6] The effect of a noisy driving 



In the previous subsection we have derived an expression for the transition probabihty if the driving is purely harmonic. 
In this section we assume a noisy driving. The "noise" is characterized by a correlation function 

{fit)fit'))^F{t-t') (533) 

where the average is taken over realizations of f{t). The Fourier transform of of the correlation function is the power 
spectrum of the noise F{uj). From the transition probability formula we get 

Ptin\m) = \Wnm\^ f f\f{t')fit")y''-^''-'"'>dt'dt" « \Wnm\^Fiu;=E^rn)xt (534) 



where the approximation assumes that the duration of the noise much larger compared with its correlation time. 
The intensity of the noise F{0) = (/^) = is commonly characterized by its RMS value e. Accordingly the 'area' 
under the power spectrum F{lij) is 2Tr£^. Of particular interest is the case of driving by low frequency noise. In such 
case F(uj) is very narrow. Its spectral width 2tt /tc, is determined by the correlation time Tc- More generally we 
can consider a noisy harmonic driving, for which F(lu) is concentrated around some frequency Q. In such case it is 
suggestive to re-write the above result as 

Pt{n\no) « 2Tre^ \Wn,no\^ S^^/rSi-^n - ErJ -Q)xt (535) 

Note that e can be absorbed into the definition of W. The above result for constant rate transition between levels is 
commonly regarded as a special version of the Fermi golden rule that we discuss in the next section. 

= [18.7] The Fermi golden rule (FGR) 



The main transitions which arc induced by a purely harmonic driving is to energy levels that obey the "resonance 
condition" : 

iEn~En,)^hn (536) 

From the expression we found, we see that the probability of transition to levels that obey \En — {Eng + ^)\ < 2TTh/t 
is proportional to t^. That is what we would expect to get by the definition of the Hamiltonian as the probability 
amplitude for transitions per unit time. But the "width" of the area that includes these levels is proportional to 
27rh/t. From this we conclude that the probability of transition to other levels grows linearly. We will call the rate of 
transition to other levels F. 

r = J|W„,„J2 = 27r g{E)\Wn^rJ^ (537) 
The formula can be proved by calculating the probability to stay in level no: 

P{t) = 1 - ^ Ptin\no) = 1- J ^P,{E\Eo) = 1 - '^\Wn,no\' = 1 - Fi (538) 



It is implicit in the above derivation that we assume a dense spectrum with well defined density of states. We also 
assume that the relevant matrix elements are all of the same order of magnitude. 
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Let us discuss the conditions for the vahdity of the Fermi golden rule picture. First-order perturbation theory is vahd 
while P(t) « 1, or equivalently Tt <^ 1. An important time scale that gets into the game is the Heiscnberg time which 
is defined as: 

iff = — (539) 
We will distinguish below between the case of weak perturbation {\W\ <C A) from the case of strong perturbation 

{m>A). 

In the case ^ A perturbation theory is still valid when t ~ tn- If perturbation theory is valid up to this time 
then it is valid at any time, since after the Heisenberg time the (small) probability that has moved from the initial 
state to other energy levels oscillates, and does not grow further. This argument is based on the assumption that the 
difference (i?„ — [Eng + ri)) is of the order A, even for levels in the middle of the resonance. If there is an "exact" 
resonance, it is possible to show that the probability will oscillate between the two energy levels, as in the "two-site" 
problem, and there is no "cumulative" leakage of the probability to other levels. 

We will now discuss the case A < \W\. In this case, first-order perturbation theory breaks down before the Heisenberg 
time. Then we must go to higher orders of perturbation theory. With some limitation we find the result: 

P{t) = e-^* (540) 

This means that there is a decay. In another lecture we analyze a simple model where we can get this result exactly. 
In the general case, this is an approximate result that is valid (if we are lucky) for times that are neither too long nor 
too short. 



[19] Master equations 



The FGR is commonly used in order to determine the rates constants in Master equations that provide a reduced 
description for the dynamics of a system that is coupled to a bath or to a noise source. Here we demonstrates how 
it emerges in the latter case. The Hamiltonian is H = Hq + f{t)W, were f{t) represents white: that means that 
upon ensemble average (/(i)) = 0, while {fit) fit')) = v5{t — t'). The Liouville von-Neumann equation for the time 
evolution of p can be solved iteratively in order to determine p{t + dt) given pit), where dt is small time interval: 

p{t + dt) = p~i dt' [n, p\~ / dt'dt" [n, [n, p]] + ... (541) 



Averaging over realizations of f{t) all the odd orders in this expansion vanish, while the leading dt contribution comes 
only from the zero order term that involves Ho and from the second order term that involves W. Consequently we 
get the following Master equation: 

^ = -t[Ho,p]-^iy[W,[W,p]] (542) 

In the most standard example W — x, and the second term corresponds to the diffusion term (y /2)d'^ p/ dp^ in the 
classical Fokker-Plank equation. 

In the weak noise limit the rate of the noise induced transitions becomes much smaller compared with the Bloch 
frequency of the coherent transitions. Then we can ignore all the highly oscillating terms that involve the off-diagonal 
terms, because they average to zero. Consequently we get the so-called Pauli mater equation for the probabilities p„ 

^ = Wp I -Fs ... I (543) 
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In this equation the W matrix is in agreement with the FGR picture. Namely, the rate constants are Wnm = v\Wnm\'^ : 
and the decay constants are = Wnm- We note that if we had interaction with a bath, then the ratio ^12/^21 
would not be unity, but would favor downwards transitions. 

The noise induced "diffusion" term in the master equation that we have derived is WpW — {l/2){WWp + pWW). 
We can regard it as a special case of the so-called Lindblad equation: 

^ = -^[^L,,p\ + Y,[W^]p[W^]^ - \[Tp + pT] (544) 

r 

where T = X)?-!^'"]^!^'^]- Lindblad equation is the most general form of a Markovian master Equation for the 
probability matrix. The heart of the argument is as follows: The most general linear relation between matrices is 

Pa/3 = ^W(a/3|a'/3')Pa'/3' (545) 

a'P' 

This linear transformation is required to preserve the hcrmiticity of p. Changing notation to W{a(3\a' (3') — yVaa'.pfS' 
one observes that Waa',i3p' should be hermitian, with eigenvalues Wr- Accordingly we can find a spectral decomposition 
with a transformation matrix T{aa'\r). Changing notation to ^, = yi'w^T (aa' \r) we get the Lindblad from 
Xj,[W^'"]p['H^'']^. Now we have to distinguish between two cases: If p is the new probability matrix, then conservation 
of trace requires Xirl^'^]^!^'^] ~ "-"^ other hand we look on the incremental change p, then (after some 

non-trivial algebra) we arrive to the Lindblad from of the Master equation. There is a complete analogy here with 
the Pauli equation where the decay rates in the main diagonal of W have to counter-balance the ingoing transitions. 
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[20] The cross section in the Born approximation 
= [20.1] Cross Section 

In both classical mechanics and quantum mechanics there are two types of problems: closed systems and open systems. 
We will discuss an open system. The dynamical problem that we will analyze is called a " scattering problem" . For 
example, a wave that is scattered on a sphere. In a problem of this type the energy is given. We assume that there 
is an " incident particle flux" and we ask what is the " scattered flux" . 




shadow 



We notice that the sphere "hides" a certain area of the beam. The total hidden area is called the "total cross 
section" CT^otah assume that we have a beam of particles with energy E and velocity ve, so that the current 

density is: 



J[particles/time/area] = pqVe (546) 
Where po is the particle density. We write the scattered current as: 



^scattered = I'^totall ^ (547) 

Where the cross section cTj^otal defined as the ratio of the scattered current ^scattered incident particle flux 

density J. We notice that each area element of the sphere scatters to a different direction. Therefore, it is more 
interesting to talk about the differential cross section a{il,). In full analogy a{ft)dil is defined by the formula: 

^scattered = W^d^] x J (548) 
Where *gcattered current that is scattered into the angular element dVl. 



== [20.2] Cross section and rate of transition 

For the theoretical discussion that will follow, it is convenient to think of the space as if it has a finite volume 
= LxLyLz with periodic boundary conditions. In addition we assume that the "incident" beam takes up the whole 
volume. If we normalize the particle density according to the volume then po = With this normalization, the 

flux J (particles per unit time) is actually the "probability current" (probability per unit time), and the current 
^scattered ^^'^ scattering rate. Therefore an equivalent definition of the cross section is: 

T{k e dn\ko) = [a{^)(m] X -^VE (549) 

Given the scattering potential U{r) we can calculate its Fourier transform U{q) 



U{q) = FT[U{r)] = / / / C/(r)e-'« *'ciV 



(550) 
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Then we get from the Fermi golden rule (see derivation below) a formula for the differential cross seetion which is 
called the " Born approximation" : 



(2^)2 \VE 



— \U{k~ko) 



(551) 



The second expression assumes the non-rclativistic dispersion relation ve = kE/m. The Born approximation is a 
first-order perturbation theory approximation. It can be derived with higher order corrections within the framework 
of scattering theory. 



[20.3] The DOS for a free particle 



In order to use the Fermi golden rule we need an expression for the density of states of a free particle. In the past 
we defined g{E)dE as the number of states with energy E < Ek < E + dE. But in order to calculate the differential 
cross section we need a refined definition: 



g{E, Q,)dEdil Number of states withi; < Ek < E + dE and k e dVl 



(552) 



If we have a three-dimensional space with volume ~ L^LyLz and periodic boundary conditions, then the momentum 
states are: 



27r 



27r 2n 



J^y 



(553) 



The number of states with a momentum that in a specified region of k space is: 



dkx dky dk^ 

2-n 2-K 2tt 



-d^k 



-k^dkdfl 



L3 



■kl—dn 



(554) 



(2^)2 (27r)3 (27r)3 ve 

Where we have moved to spherical coordinates and used the relation dE = VEdk. Therefore, we find the result: 

g{E,n)dEdn = j—r^dEdn (555) 



(2^)3 VE 
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== [20.4] Derivation of the Born formula 

Let us assume that we have a flux of particles that are moving in a box with periodic boundary conditions in the 
z direction. As a result of the presence of the scatterer there arc transitions to other momentum states (i.e. to other 
directions of motion). According to the Fermi golden rule the transition rate is: 

r(A:edr2|fco) = 27r[g(£:,rj)dl7] |[/fc,fcj2 (556) 

By comparing with the definition of a cross section wc get the formula: 

2-77 

aiVL) = —L" Q[E,n)\UuM,\^ (557) 
We notice that the matrix elements of the scattering potential are: 

(fc|[/(r)|fc;) = y^e-*'-C/(r)e^'=« '- = ^ J Uir) e-''-''-'''>>^d'r ^ ^U{k - h) (558) 
By substituting this expression and using the result for the density of states we get the Born formula. 



== [20.5] Scattering by a spherically symmetric potential 

In order to use the Born formula in practice we define our system of coordinates as follows: the incident wave 
propagates in the z direction, and the scattering direction is = {Oii,(pii). The difference between the k of the 
scattered wave and the fco of the incident wave is q = k — kQ. Next we have to calculate [/((?) which is the Fourier 
transform of U{r). If the potential is spherically symmetric we can use a rotated coordinate system for the calculation 
of the Fourier transform integral. Namely, we can use spherical coordinates such that 9 = is the direction of q. 
Consequently 

U{q) ^ J J J U{r)e-""'''°'^'^'>d(fdcosieydr = 4tt U (r) smc{qr) dr (559) 

where the angular integration has been done using 

^ rg--»As-|i t\ _ -t\ 2sin(A) , , 

^ ' = 2sinc(A) (560) 



1 



iX X 



We can go on calculating the total cross section: 



1 fkE^^ 



(2^)2 \VE 

We note that by simple trigonometry 



'^total = / I ^mdn = I |C/(<7)|^27rsinMn (561) 







q = 2fc£; sin j dq = ks cos {^^"j dO^ ^ sui9nd0n = ^^ (562) 

Hence we can write the integral of the cross section as 



^total = ^ X '^^'^ (^^^^ 
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[21] Dynamics in the adiabatic picture 
== [21.1] The notion of adiabaticity 

Consider a particle in a one dimensional box with infinite walls. We now move the wall. What happens to the particle? 
Let us assume that the particle has been prepared in a certain level. It turns out that if the wall is displaced slowly, 
then the particle will stay in the same level. This is called the " adiabatic approximation" . We notice that staying in 
the same energy level means that the state of the particle changes! If the wall is moved very fast then the state of the 
particle does not have time to change. This is called the "sudden approximation". In the latter case the final state 
(after the displacement of the wall) is not an eigenstate of the (new) Hamiltonian. After a sudden displacement of 
the wall, the particle will have to "ergodize" it state inside the box. 

The fact that the energy of the particle decreases when we move the wall outwards, means that the particle is doing 
work. If the wall is displaced adiabatically and then displaced back to its original location, then there is no net work 
done. In such case we say that the process is reversible. But if the displacement of the wall is not slow then the 
particle makes transitions to other energy levels. The scattering to other energy levels is in general irreversible. 

In the problem that we have considered above, the parameter X that we change is the length L of the box. Therefore 
V = X is the velocity at which the wall (or the "piston") is displaced. In other problems X could be any field. An 
important example is a particle in a ring where X = $ is the magnetic flux through the ring, and EMF = —X is the 
electro motive force (by Faraday law). In problems of this type, the change in the parameter X can be very large, 
so we cannot use standard perturbation theory to analyze the evolution in time. Therefore, we would like to find 
another way to write Schrodinger's equation, so that X is the small parameter. 



=^^= [21.2] The Schrodinger equation in the adiabatic basis 

We assume that we have Hamiltonian ^{{Q, P; X) that depends on a parameter X. The adiabatic states are the 
eigenstates of the instantaneous Hamiltonian: 



n{X) \n(X)) = En(X) \n(X)) (564) 

It is natural in such problems to work with the adiabatic basis and not with a fixed basis. We will write the state of 
the system as: 



\^)=J2a^it)\n{Xm (565) 

n 

which means a„(<) = {ti{X {t))\'^ (t)) . If we prepare the particle in the energy level no and change X in an adiabatic 
way, then we expect |a„(i)p « Sn.no- * later stage we shall find a slowness condition for the validity of this 
approximation. But in general we have the Schrodinger's equation 

— = -iH{x,p-X{t))^ (566) 
from which we get: 

^ = + = -M'nij)+Y.^jn\m){m\^) = _ji?„a„ + X ^(^n|m)a,„ (567) 



Using the notation 



n|m) (568) 
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The Schrodingcr's equation takes the form 

= -iEnUn +iX'^ Anrndm (569) 
m 

For sake of analysis it is convenient to separate the diagonal part of the perturbation. So we define A„ = Ann- We 
pack the off diagonal part into a matrix which is defined as Wnm = ~^^nm for n ^ m and zero otherwise. With 
these notations the Schrodingcr's equation in the adiabatic representation takes the form 

= -i{En~ XAn)an- i^Wnmam (570) 

m 

It should be notices that the strength of the perturbation in this representation is determined by the rate X and not 
by the amplitude of the driving. 



[21.3] The calculation of A, 



Before we make further progress we would like do dwell on the calculation of the perturbation matrix Anm- First of 
all we notice that 

d d 

Anm = "^^^"'l™) = '("l^*^) (571) 

This is true because for any X 

(n|m) = 5nra (572) 

d 

{-^n\m.) + {n\—m) = 

In other words {-^nlm) is anti-Hcrmitian, and therefore — i(^?i|m) is Hcrmitian. We define more notations: 

An{X) = Ann - M-^u) (573) 



dU 
dX 



We want to prove that for n ^ m: 



(574) 



E -E 

This is a very practical formula. Its proof is as follows: 

{n\n\m) =0 for n 7^ m (575) 

d d d d d 

^ + (7i|^W|m) + {n\H\-^m) = E„,{—n\m) + K,„ + En{n\—m) = 



From the latter equality we get the required identity. 
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[21.4] The adiabatic approximation 



If X is small enough the perturbation matrix W will not be able to induce transitions between levels, and then we 
get the adiabatic approximation |a„(t)p ~ const. This means that the probability distribution docs not change with 
time. In particular, if the particle is prepared in level ri, then is stays in this level all the time. 

From the discussion of first-order perturbation theory we know that we can neglect the coupling between two different 
energy levels if the absolute value of the matrix element is smaller compared with the energy difference between the 
levels. Assuming that all the matrix elements are comparable the main danger to the adiabaticity are transitions to 
neighboring levels. Therefore the adiabatic condition is \W\ <C A or 

A2 

X < — (576) 
ri<7 

where a is the estimate for the matrix element Vnm that couples neighboring levels. 

An example is in order. Consider a particle in a box of length L. The wall is displaced at a velocity X. Given that the 
energy of the particle is E we recall that the energy level spacing is A = [i: / L)ve, while the coupling of neighboring 
levels, based on a formula that we have derived in a previous section, is 

It follows that the adiabatic condition is 

a: < (578) 
mi 

Note that the result does not depend on E. This is not the typical case. In typical cases the density of states increases 
with energy, and consequently it becomes more difficult to satisfy the adiabatic condition. 

Assuming we can ignore the coupling between different levels, the adiabatic equation becomes 

^ = -i{E,, - XA,,)a„ (579) 
And its solution is: 

an{t) = e-'/o(^"-^^")'^*'a„(0) (580) 

As already observed the probability |a„(t)p to be in a specific energy level does not change in time. That is the 
adiabatic approximation. But it is interesting to look also at the phase that the particle accumulates. Apart from 
the dynamical phase, the particle also accumulates a geometrical phase 

ft ft fX{t) 

phase = / {E^-XAr,)dt' = - / E^dt' + / A^{X')dX' (581) 
Jo Jo Jx{o) 

An interesting case is when we change more than one parameter. In this case, just as in the Aharonov-Bohm effect, 
the particle accumulates a "topological" phase that is called the "Berry phase". 

Berry phase = <j> An{X)dX (582) 



In fact, the Aharonov-Bohm effect can be viewed as a specific case of the topological effect that was explained above. 
In order to discuss further topological effects we have to generalize the derivation of the Adiabatic equation. This will 
be done in the next section. 
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[22] The Berry phase and adiabatic transport 

In the following lectures we discuss the adiabatic formalism and the linear-response (Kubo) theory adopting the 
presentation and the notations as in [http://arxiv.org/abs/cond-mat/0307619]. Note that unlike most textbooks 
we write the Schrodinger equation in the adiabatic representation without switching to the "interaction picture" . 

== [22.1] Definitions of A and B 



The adiabatic equation is obtained from the Schrodinger equation by expanding the wavefunction in the a;-dependent 
adiabatic basis: 

||^) = -\n{x{t)) IV-) (583) 



71 



m J 

where we define 

^imix) = ih(n{x)\-^m{x)^ (584) 

Differentiation by parts of dj {n{x)\m{x)) = leads to the conclusion that is a Hcrmitian matrix. Note that the 
effect of gauge transformation is 

\n{x)) ^ e-'^ \n{x)) (585) 

Note that the diagonal elements = are real, and transform as A:^ i-^ A:^ + 9jA„. 
Associated with An{x) is the gauge invariant 2-form, which is defined as: 

= d,Ai~d,A'^ = -2hlm{dkn\d,n) = -^Im^AL.^^„„ (586) 

m 

This can be written in abstract notation as _B = V A ^. 

Using standard manipulations, namely via differentiation by parts of dj {n{x)\Ti.\m(x)) = 0, we get for n =/= m the 
expressions: 



and hence 



on 



dxj 



(587) 



Tm 1 



The mathematical digression in the next section is useful in order to clarify the analogy between B and the notion of 
magnetic field. This leads to better understanding of topological effects that are associated with adiabatic processes. 
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22.2] Vector Analysis and "Geometrical Forms" 

Geometrical forms are the "vector analysis" generalization of the length, area and volume concepts to any dimension. 
In Euclidean geometry with three dimensions the basis for the usual vector space is ei, 62, 63. These are called 1-forms. 
We can also define a basis for surface elements: 

ei2 = ei A 62 (589) 

623 = 62 A 63 
631 = 63 A 61 

These are called 2-forms. We also have the volume element ei A 62 A 63 which is called 3-form. There is a natural 
duality between 2-forms and 1-forms, namely 612 ^— > 63 and 623 i— > ei and 631 i— > 62. Note that 621 — —612 i-^ —63, 
and 61 A 62 A 63 = —62 A ei A 63 etc. 

The duality between surface elements and 1-forms does not hold in Euclidean geometry of higher dimension. For 
example in 4 dimensions the surface elements (2-forms) constitute C4 = 6 dimensional space. In the latter case we 
have duality between the hyper-surface elements (3-forms) and the 1-forms, which are both 4 dimensional spaces. 
There is of course the simplest example of Euclidean geometry in 2 diemnsional space where 2-forms are regarded as 
either area or as volume and not as 1-form vectors. In general for N dimensional Euclidean geometry the k forms 
constitute a dimensional vector space, and they are dual to the {N — k) forms. 

We can take two 1-forms (vectors) so as to create a surface element: 

J2 ^»e, A J2 ^'^j = J2 ^^^J-e; A Cj = ^{A.Aj ~ AjA,)ei A Cj (590) 

hi i<j 

Note that A ii — Null. This leads to the practical formula for a wedge product 

{AAB),j^A,Bj-AjBi (591) 
We can also define the notation 

{d/\A),, =d,A, -djA, (592) 

Note that in 3 dimensional Euclidean geometry wc have the duality 

d AA^ V X A if A is a 1-forms (593) 
d AB ^ V ■ B if S is a 2-forms 

The above identifications are implied by the following: 

d = diei + die2 + 8163 (594) 

A = Alii +^262 + ^363 

B = B12 ■ (§12 + i?23 • 623 + -B31 • 631 

hence 

{d A A)i2 = diA2 - 92^1 = (V X ^)3 etc (595) 

and 



d A B ^ {diii + ^262 + 9363) A (-612612 4- -823623 + -831631) = (9i-B23 + 92-B31 + 93-612)6123 i-^ V • -B (596) 
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The generalized Stokes theorem relates the closed boundary integral over fc-form to an integral over (fc + 1) form 
within the interior. 

A-dl^ J j dAA-ds (597) 

In 3 dimensional Euclidean geometry this is the Stokes Integral Theorem if A is a 1-form, and the Divergence Integral 
Theorem if A is a 2-form. 

^= [22.3] The Berry phase 

We define the perturbation matrix as 

Wnm = - ^ ij^mri for 71 7^ 771 (598) 

] 

and = for n = m. Then the adiabatic equation can be re-written as follows: 

^ = --^(^n - iA„)a„ - W„™am (599) 

m 

If we neglect the perturbation W ^ then we get the strict adiabatic solution: 

\^[t)) = e-i(/o^"(-(*'))'^*'-r(o)^"(-)"^-) \n[x{t))) (600) 

The time dependence of this solution is exclusively via the x dependence of the basis states. On top, due to An{x) 
we have the so called geometric phase. This can be gauged away unless we consider a closed cycle. For a closed cycle, 
the gauge invariant phase § A - dx is called the Berry phase. 



With the above zero-order solution we can obtain the following result 



m 

dxk 



m ) - nix) \n{x)) (601) 



In the case of the standard examples that were mentioned previously this corresponds to a conservative force or to a 
persistent current. From now on we ignore this trivial contribution to {J'^)^ and look for the a first order contribution. 

= [22.4] Adiabatic Transport 



For linear driving (unlike the case of a cycle) the An [x) field can be gauged away. Assuming further that the adiabatic 
equation can be treated as parameter independent (that means disregarding the dependence of En and W on x) one 
realizes that the Schrodinger equation in the adiabatic basis possesses stationary solutions. To first order these are: 

Note that in a fixed-basis representation the above stationary solution is in fact time-dependent. Hence the explicit 
notations \n{x{t))) and \m{x(t))) are possibly more appropriate. 

With the above solution we can write {J-'^) as a sum of zero order and first order contributions. From now on we 
ignore the zero order contribution, but keep the first order contribution: 



m{^n) ri m \ / j \ m / j 



(603) 
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For a general stationary preparation, either pure or mixed, one obtains 

{T'^) = -Y^G'^^ i, (604) 

j 

with 

G'^^ = Y.f{E^)B^^ (605) 

n 

Where f{E„) are weighting factors, with the normalization J2nfi-^n) = 1- For a pure state preparation f{En) 
distinguishes only one state n, while for a canonical preparation /(-En) oc e"^"/"^, where T is the temperature. 
For a many-body system of non- interacting particles f{En) is re-interpreted as the occupation function, so that 
J2n fi^n) = N is the total number of particles. 

Thus we see that the assumption of a stationary first-order solution leads to a non-dissipative (antisymmetric) con- 
ductance matrix. This is known as either " adiabatic transport" or " geometric magnetism" . In the next sections we 
are going to see that " adiabatic transport" is in fact a special limit of Kubo formula. 

== [22.5] Beyond the strict adiabatic limit 

If the driving is not strictly adiabatic the validity of the stationary adiabatic solution becomes questionable. In general 
we have to take non- adiabatic transitions between levels into account. This leads to the Kubo formula for the response 
which we discuss in the next section. The Kubo formula has many type of derivations. One possibility is to use the 
same procedure as in the previous section starting with 

We shall not expand further on this way of derivation, which becomes quite subtle once we go beyond the stationary 
adiabatic approximation. The standard textbook derivation is presented in the next section. 



-iW„ 



(606) 
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[23] Linear response theory and the Kubo formula 
== [23.1] Linear Response Theory 



We assume that the Hamiltonian depends on several parameters, say three parameters: 

n = W(f,p; xi{t),X2{t),X3{t)) (607) 
and we define generahzed forces 

^F' ^ (608) 

OXk 

Linear response means that 

/CO 
a''^{t-t') Sxj{t')dt' (609) 

Where a*=^ (r) = for r < 0. The expression for the response Kernel is known as the Kubo formula: 

a'^ir) = eir)^{[:F\r),:F^m)o (610) 

Where the average is taken with the assumed zero order stationary solution. Before we present the standard derivation 
of this result we would like to illuminate the DC limit of this formula, and to further explain the adiabatic limit that 
was discussed in the previous section. 



== [23.2] Susceptibility and DC Conductance 

The Fourier transform of a'^^ (t) is the generalized susceptibility x*'"' (w). Hence 

K^'^)]. - ^xo''HN.-E/^''H[^^-]- (611) 

j 3 

where the dissipation coefficient is defined as 

/.(c.) = I^^ii^ = fa^^M^^dr (612) 

LU Jq LO 

In the "DC limit" (w 0) it is natural to define the generalized conductance matrix: 

G^^ = /^(oj-O) = lim ^ f a^3(^r)TdT (613) 

Consequently the non-conservative part of the response can be written as a generalized Ohm law. 

{T^) = -Y,G^= X, (614) 
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It is convenient to write the conductance matrix as 

where rj''^ — rj^^ is the symmetric part of the conductance matrix, while B^^ ~ ^B^^ is the antisymmetric part. 
In our case there are three parameters so we can arrange the elements of the antisymmetric part as a vector 
B = {B^^, B'^^, B^^). Consequently the generalized Ohm law can be written in abstract notation as 

{J") ^ -T]- X - B Ax (616) 

where the dot product should be interpreted as matrix-vector multiplication, which involves summation over the 
index j. The wedge-product can also be regarded as a matrix- vector multiplication. It reduces to the more familiar 
cross-product in the case we have been considering - 3 parameters. The dissipation, which is defined as the rate at 
which energy is absorbed into the system, is given by 

W = -{T)-x = ^if'^ XkXj (617) 

k] 

which is a generalization of Joule's law. Only the symmetric part contributes to the dissipation. The contribution of 
the antisymmetric part is identically zero. 

The conductance matrix is essentially a synonym for the term "dissipation coefficient". However, "conductance" is a 
better (less misleading) terminology: it does not have the (wrong) connotation of being specifically associated with 
dissipation, and consequently it is less confusing to say that it contains a non-dissipative component. We summarize 
the various definitions by the following diagram: 




^k] Qkj 

(dissipative) (non-dissipative) 



[23.3] Derivation of the Kubo formula 



A one line derivation of the Kubo formula is based on the interaction picture, and is presented in another section of 
these lecture notes. There are various different looking derivations of the Kubo formula that highlight the quantum- 
to-classical correspondence and/or the limitations of this formula. The advantage of the derivation below is that it 
also allows some extensions within the framework of a master equation approach that takes the environment into 
account. For notational simplicity we write the Hamiltonian as 



n = -Ho -f{t)v 



(618) 
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and use units with h= 1. We assume that the system, in the absence of driving, is prepared in a stationary state po- 
In the presence of driving we look for a first order solution p{t) = po + p{t)- The equation for p{t) is: 



^ « -z[Ho,/5(t)]+z/(i)[l/,po] (619) 

Next we use the substitution p{t) = Un{t)p{t)Uo{t)^^ , where Uo{t) is the evolution operator which is generated by 
Tio- Thus we eliminate from the equation the zero order term: 



^ « if{t)[Uo{t)-'VU„{t),po] (620) 
The solution of the latter equation is straightforward and leads to 

p{t) ^ P0+ f i[V{-{t-t')),po] f{t')dt' (621) 



where we use the common notation V{t) — Uo{t) ^VUo{t). 

Consider now the time dependence of the expectation value {J-)t = trace(jFp(t)) of an observable. Disregarding the 
zero order contribution, the first order expression is 



{T)t ^ f\trBce{T[V{-{t-t')),po]) f{t')dt' = ( a{t - 1') f{t')dt' 



where the response kernel a(T) is defined for t > as 



a{T)^itr^ce{T[V{-T),po]) = i tr^ce{[T ,V{-t)]p^) = ,V{-t)]) =i{[T{t),V]) (622) 

Where we have used the cyclic property of the trace operation; the stationarity UoPqUq^ = po of the unperturbed 
state; and the notation J-{t) ~ Ua{T)~^ TUq{t) . 
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[24] The Born-Oppenheimer picture 

We now consider a more complicated problem, where x becomes a dynamical variable. The standard basis for the 
representation of the composite system is \x, Q) = \x) ® \Q). We assume a total Hamiltonian of the form 

= + (623) 

zm ^ — ' 

3 

Rather than using the standard basis, we can use the Born-Oppenheimer basis \x^n{x)) = \x) ® \n{x)). Accordingly 
the state of the combined system is represented by the wavefunction \E'„(a;), namely 

I*) = IZ*"(^) (624) 
The matrix elements of TL are 



{x,n{x)\H\xQ,m{xQ)) = (S (a; - Xq) x (x) (625) 

The matrix elements of ^(Q) are 

{x,n{x)\V{Q)\x(i,m{xa)) = 5{x - x^) y.Vnni{x) (626) 



The matrix elements of p are 



{x,n{x)\pj\xo,m{xo)) = {~idjS{x~xo)) x {n{x)\m{xo)) 
The latter can be manipulated "by parts" leading to 

{x,n{x)\pj\xo,m{xQ)) -idjSix - XQ)Snm - S{x - xo)Al^^{x) (627) 



This can be summarized by saying that the operation of pj on a wavefunction is like the differential operator 
—idj — A:^,„(x). Thus in the Born-Oppenheimer basis the total Hamiltonian takes the form 

= ^Y.'^P,-Ai„Mf+SnMx)~f{t)Vn,n{x) (628) 
j 

Assuming that the system is prepared in energy level n, and disregarding the effect of A and V, the adiabatic motion of 
X is determined by the effective potential En{x). This is the standard approximation in studies of diatomic molecules, 
where x is the distance between the nuclei. If we treat the slow motion as classical, then the interaction with A can 
be written as 

hL\L.o„ = -5]i,A^„,(x) (629) 

This brings us back to the theory of driven systems as discussed in previous sections. The other interaction that can 
induce transitions between levels is 

hL'L.o„ = -fmnM (630) 



The analysis of molecular " wavepacket dynamics" is based on this picture. 



The Green function approach 
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[25] The propagator and Feynman path integral 
^^^^= [25.1] The propgator 



The evolution of a quantum mechanical system is described by a unitary operator 

= C/(t,to) |*(to)) (631) 

The Hamiltonian is defined by writing the infinitesimal evolution as 

U{t + dt,t) ^ I -idtnit) (632) 

This expression has an imaginary i in order to make Ti. a Hermitian matrix. If we want to describe continuous 
evolution we can "break" the time interval into N infinitesimal steps: 

U(t,to) ^ {1 - idtNH) . . . (1 - idt2n)il - idtiH) = Te'^-'^'o (633) 

For a time independent Hamiltonian we get simply U{t) = e~'*^ because of the identity e'^e'^ = c^^^ if [A, B] = 0. 

If we consider a particle and use the standard position representation then the unitary operator is represented by a 
matrix U{x\xo). The time interval [to,t\ is implicit. We alway assume that t > to- Later it would be convenient to 
define the propagator as U{x\xo) for t > to and as zero otherwise. The reason for this convention is related to the 
formalism that we are going to introduce later on. 

== [25.2] The Propagator for a free particle 



Consider a free particle in one dimension. Let us find the propagator using a direct calculation. The hamiltonian is: 

H=|- (634) 
Zm 

As long as the Hamiltonian has a quadratic form, the answer will be a Gaussian kernel: 

/ m \ 2 j^f^^_^^-^2 



Uix\xo) = [x e-*-P xo) = [—) e^-t--o) (635) 

We note that in the case of a harmonic oscillator 
1 

n = — + -mn^x"^ (636) 
2m 2 ^ ^ 

The propagator is 

U{x\xo) = ( "'^^ ) ' e'2iT^l<^°-^t(-'+-l)-^--o] (637) 
^ ' ' \27rismntJ ^ ' 

If we take t then U I, and therefore U{x\xo) — > S{x ~ xq). 
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The derivation of the expression for the propagator in the case of a free particle goes as follows. For clarity we use 
units such that m = 1: 



(x|e-^^*P \xo) 



^y{x\k)e-'^"'"{k\xo) ^ I — e-'^*'='+ 
u J 27r 



ik{x—XQ ) 



(638) 



This is formally an inverse FT of a Gaussian. One obtains the desired result, which is formally a Gaussian that has 
an imaginary variance cr^ = it. We note that we can write the result in the form 



(x|e-^^*p'|xo) 



1 .(x-xq)^ 



1 



zC 2t 



cos ^(a- - xaY + isin^(a; - I'o)^ 



(639) 



If t ^ we should get a delta function. This is implied by the FT, but it would be nice to verify this statement 
directly. Namely we have to show that in this limit we get a narrow function whose " area" is unity. For this purpose 
we use identity 



cos r . . . dr 



cos It 



. du 



(640) 



and a similar expression in case of sin function. Then we recall the elementary integrals 



smit / cosu /TT 

— r^du = / — r^du = \ — 
^ Jo V 2 



(641) 



Thus the "area" of the two terms in the square brackets is proportional to (1 + i)/V2 which cancels the \/I of the 
prcfactor. 



25.3] Feynman Path Integrals 



How can we find the propagator U{x\xo) for the general Hamiltonian H = ^ + V{x)7 The idea is to write (x|e **^|a;o) 
as a convolution of small time steps: 



(x|e-^^*««|xjv^i)...(x2|e-'^*^«|xi)(xi|e-*^*^^|xo) 



(642) 



Now we have to find the propagator for each infinitesimal step. At first sight it looks as if we just complicated the 
calculation. But then we recall that for infinitesimal operations we have: 



for any A and B 



(643) 



This is because the higher order correction can be made as small as we want. So we write 

{xj\e 
and get: 



« (x,|e-^^*^(-))e-^^*fe|a;,_i) « (_^)ie'5%(-.— 



(644) 



U{x\xo) = J dxidx2...dxN-i{^^)'''e'^^''^ = J d[x] 



JA\x] 



(645) 
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where A\x] is called the action. 



N-l ,. 

A[x\ = [^(-^J - ^J-i)^ - dtVix) = / {-mx^ - V{x))dt = / C{x,x)dt 



(646) 



More generally, if we include the vector potential in the Hamiltonian, then we get the Lagrangian 
£(x,i) = ^mx^ ~ V [x) A{x)x 



(647) 



leading to: 



-^N = / ^2 ^ V{x))dt + / A{x) ■ dx 



(648) 



== [25.4] Stationary Point Approximation 

The Stationary Point Approximation is an estimate for an integral of the form J e^-^^^^dx. The main contribution to 
the integral comes from the point x = x, called a stationary point, where A'{x) = 0. We expand the function A{x) 
near the stationary point: 



A{x) 



Aix) + -A"ix){x-x)^ 



(649) 



Leading to 



/ i2TT 



(650) 



Where the exponential term is a leading order term, and the prefactor is an " algebraic decoration" . 

The generalization of this method to multi-dimensional integration over d[x] is immediate. The stationary point is 
in fact the trajectory for which the first order variation is zero {SA ~ 0). This leads to Lagrange equation, which 
implies that the "stationary point" is in fact the classical trajectory. Consequently we get the so called semiclassical 
(Van-Vleck) approximation: 



U{x\xo) 



d[x] e 



iA[x 



y.(—) 



d/2 



det 



d^Ac 



dxdxo J 



1/2 



iAci{x,xo)-i{-rr/2)i^cl 



(651) 



Where d is the number of degrees of freedom, and 



Aci{x,xo) = A[xci] 



(652) 



is the action of the classical trajectory as a function of the two end points. In d dimensions the associated determinant 
is dx d. The Morse-Maslov index I'ci counts the number of conjugate points along the classical trajectory. If the time 
is short it equals zero and then both this index and the absolute value that enclosed the determinant can be omitted. 
Otherwise the recipe for the determination of the Morse index is as follows: The linearized equation of motion for 

x{t) = Xci{t) + Sx{t) is 



mSx + V"{xci)6x = 



(653) 
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A conjugate point (in time) is defined as the time when the hnearized equation has a non-trivial solution. With 
this rule we expect that in case of a reflection from a wall, the Morse index is +1 for each collision. This is correct 
for "soft" wall. In case of "hard" wall the standard semiclassical result for i/ci breaks down, and the correct result 
turns out to be +2 for each collision. The latter rule is implied by the Dirichlet boundary conditions. Note, that 
it would be +0 in the case of Neumann boundary conditions. The Van-Vlcck semiclassical approximation is exact 
for quadratic Hamiltonians because the " stationary phase integral" is exact if there are no higher order terms in the 
Taylor expansion. 

Let us compute again U{x\xo) for a free particle, this time using the Van-Vleck expression: The action A[x] for the 
free particle is 




(654) 



Given the end points we find the classical path 



Xcl = Xq + 



i 



(655) 



and hence 




(656) 



We also observe that 




m 



t 



(657) 



which leads to the exact result. The big advantage of this procedure emerges clearly once we try to derive the more 
general expression in case of a harmonic oscillator. 
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[26] The resolvent and the Green function 



[26.1] The resolvent 



The resolvent is defined in the complex plane as 

G(z) = ^— (658) 
z — 7t 

In case of a bounded system it has poles at the eigenvalues. We postpone for later the discussion of unbounded 
systems. It is possibly more illuminating to look on the matrix elements of the resolvent 

where il}^{x) = {x\n) are the eigenstates of the Hamiltonian. If we fix x and xq and regard this expression as a function 
of z this is formally the complex representation of an electric field in a two dimensional electrostatic problem. 

We can look on G(x|a;o), with fixed z = E and Xg, as a wavefunction in the variable x. We see that G{x\xq) is a 
superposition of eigenstates. If we operate on it with {E — Ti.) the coefficients of this superposition are multiplied 
by {E — Ek), and because of the completeness of the basis wc get S{x — xq). This means that G{x\xo) satisfies the 
Schrodinger equation with the complex energy E and with an added source at x = xo- Namely, 

{E~n)G{x\xo) ^S{x-xq) (660) 

This is simply the standard representation of the equation (z — Ti.)G ~ 1 which defines the matrix inversion 
G~ l/(z — 7i). The wavefunction G{x\xo) should satisfy that appropriate boundary conditions. If we deal with 
a particle in a box this means Dirichlet boundary conditions. In the case of an unbounded system the issue of 
boundary conditions deserves further discussion (see later). 

The importance of the Green functions comes from its Fourier transform relation to the propagator. Namely, 

Ft\ e{t)e-^* U{t)\ = iG{uj + ij) (661) 

Where Q{t) is the step function, ld{t)U{t) is the "propagator", and e~''* is an envelope function that guarantees 
convergence of the FT integral. Later we discuss the limit 77 ^ 0. 

We note that we can extract from the resolvent useful information. For example, wc can get the energy eigenfunction 
by calculation the residues of G{z). The Green functions, which we discuss in the next section are obtained (defined) 
as follows: 

G^iiv) ^ G{z^u;±iO) = 1— - (662) 

uj — H±iO 

From this definition follows that 

Ini[G+] EEE -^(G+ - G") = -TTd{E - n) (663) 

From here we get expression for the density of states g{E) and for the local density of states p{E), where the latter 
is with respect to an arbitrary reference state \1/ 

g{E) = -itrace(^ lm[G+{E)] ^ (664) 



p{E) = -:^(* lm[G+{E)] 



Ill 



Further applications of the Green functions will be discussed later on. 

Concluding this section we note that from the above it should become clear that there are three methods of calculating 
the matrix elements of the resolvent: 

• Summing as expansion in the energy basis 

• Solving an equation (Helmholtz for a free particle) 

• Finding the Fourier transform of the propagator 

Possibly the second method is the simplest, while the third one is useful in semiclassical schemes. 



[26.2] Mathematical digression 



Let us summarize some useful mathematical facts that are related to the theory of the resolvent. First of all we have 
the following identity for a decomposition into "principal" and "singular" parts: 

^ . 1 ^^^^^^ ^ggg^ 



where is an infinitesimal. An optional proof of this identity in the context of contour integration is based on 
deforming the contour into a semicircle in the upper complex half plane, and then displacing the pole to be on the real 
axis. Then the first term is contributed by the principal part, while the second term is contributed by the semicircle. 

The following Fourier transform relation is extremely useful, and we would like to establish it both in the forward 
and in the backward directions: 

FT[e(t)] = ^— (666) 

U! + IV 

An equivalent relation is 



FT 



m - \ 



[the FT that gives the principal part] (667) 
In the forward direction the proof is based on taking the 7^0 limit of 



oo 



FT [e(i)c"''*] = / e-('^-*")*dt = (668) 

Jo 7 - 



Note that this is not the same as 
27 



FT 



e-7l*l 



[the FT that gives the singular part] (669) 







/-co 2^ 


u + iO 



In the backward direction the simplest is to use contour integration: 

= 27ri X Residue [for t > the contour is closed in the lower half plane] (670) 

The other possibility is to stay with the original contour along the real axis, but to split the integrand into principal 
part and singular part. The integral over the principal part is in-fact the inverse Fourier transform of i/uj. Its 
calculation is based on the elementary integral 

f^°° sin(aa;) , / n , , , > 

/ — i — -dx = Sign(a)7r = ±tt (671) 
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[26.3] Analytic continuation 



The resolvent is well defined for any z away from the real axis. We define G^{z) = G{z) in the upper half of the 
complex plane. As long as we discuss bounded systems this "definition" looks like a duplication. The mathematics 
becomes more interesting once we consider unbounded systems with a continuous energy spectrum. In the latter case 
there are circumstances that allow analytic continuation of G^{z) into the lower half of the complex plane. This 
analytical continuation, if exists, would not coincide with G~{z). 

In order to make the discussion of analytical continuation transparent let us assume, without loss of generality, that 
we are interested in the following object: 



The function f{z) with z = x + iy can be regarded as describing the electric field in a two dimensional electrostatic 
problem. The field is created by charges that are placed along the real axis. As the system grows larger and larger 
the charges become more and more dense, and therefore in the "far field" the discrete sum can be replaced by 
an integral / g{E)dE where g{E) is the smoothed density of states. By "far field" we mean that Im[z] is much larger 
compared with the mean level spacing and therefore we cannot resolve the finite distance between the charges. In the 
limit of an infinite system this becomes exact for any finite (non-zero) distance from the real axis. 

In order to motivate the discussion of analytical continuation let us consider a typical problem. Consider a system 
built of two weakly coupled ID regions. One is a small "box" and the other is a very large "surrounding" . The barrier 
between the two regions is a large delta function. According to perturbation theory the zero order states of the 
" surrounding" are mixed with the zero order bound states of the " box" . The mixing is strong if the energy difference 
of the zero order states is small. Thus we have mixing mainly in the vicinity of the energies Ej. where we formerly had 
bound states of the isolated " box" . Let us assume that describes the initial preparation of the particle inside the 
"box". Consequently we have large g„ only for states with £"„ w Er- This means that we have an increased "charge 
density" in the vicinity of energies Er- It is the LDOS rather than the DOS which is responsible for this increased 
charge density. Now we want to calculate f{z). What would be the implication of the increased charge density on 
the calculation? 

In order to understand the implication of the increased charge density on the calculation we recall a familiar problem 
from electrostatics. Assume that we have a conducting metal plate and a positive electric charge. Obviously there 
will be an induced negative charge distribution on the metal plate. We can follow the electric field lines through the 
plate, from above the plate to the other side. We realize that we can replace all the charge distribution on the plate 
by a single negative electric charge (this is the so called "image charge"). 




(672) 



n 




O 
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Returning to the resolvent, we realize that we can represent the effect of the increased "charge density" using an 
"image charge" which we call a "resonance pole". The location of the "resonance pole" is written as E^. — i(rr/2). 
Formally we say that the resonance poles are obtained by the analytic continuation of G{z) from the upper half plane 
into the lower half plane. In practice we can find these poles by looking for complex energies for which the Schrodinger 
equation has solutions with "outgoing" boundary conditions. In another section we give an explicit solution for the 
above problem. Assume that this way or another we find an approximation for f{z) using such "image charges": 

f{E) = mG+{E)\^) = TiT = E F (p^^ur I0^\ + background (673) 

We observe that the sum over n is in fact an integral, while the sum over r is a discrete sum. So the analytic 
continuation provides a simple expression for G{z). Now we can use this expression in order to deduce physical 
information. We immediately find the LDOS can be written as as sum over Lorentzians. The Fourier transform of 
the LDOS is the survival amplitude, which comes out a sum over exponentials. In fact we can get the result for the 
survival amplitude directly by recalling that Q{t)U{t) is the FT of iG'^{uj). Hence 

{x\U{t)\xQ) = ^ Q^e~*^'-*-(^'-/2)* + short time corrections (674) 

r 

If involves contribution from only one resonance, then the probability to stay inside the "box" decreases exponen- 
tially with time. This is the type of result that we would expect form either Wigner theory or from the Fermi Golden 
rule. Indeed we are going later to develop a perturbation theory for the resolvent, and to show that the expression 
for F in leading order is as expected. 

== [26.4] The Green function of a bounded particle 



In order to get insight into the mathematics of G^{z) wo first consider how G{z) looks like for a particle in a very 
large box. To be more specific we can consider a particle in a potential well or on a ring. In the latter case it means 
periodic boundary conditions rather than Dirichlet boundary conditions. Later we would like to take the length L of 
the box to be infinite so as to have a "free particle". Expanding V(a;) = (a;|G(z)|a;o) in the energy basis we get the 
following expressions: 

(x|G..,(z)|xo) = 2y^ ^i^(fc»-^)^yfc»^o) ^^^^^ 

ij ^ Z — _C/7) 

n " 
1 „ikn{x—xo) 

(x|G_(^)ko) = jY. W- 

where the real /c„ correspond to a box of length L. As discussed in the previous lecture, this sum can be visualized 
as the field which is created by a string of charges along the real axis. If we are far enough from the real axis we get 
a field which is the same as that of a smooth distribution of " charge" . Let us call it the " far field" region. As we 
take the volume of the box to infinity the " near field" region, whose width is determined by the level spacing, shrinks 
and disappears. Then wc arc left with the "far field" which is the resolvent of a free particle. The result should not 
depend on whether we consider Dirichlet of periodic boundary conditions. 

The summation of the above sums is technically too difficult. In order to get an explicit expression for the resolvent 
we recall that ip{x) = {x\G{z)\xo) is the solution of a Schrodinger equation with complex energy z and a source at 
X ^ xq. The solution of this equation is 

(x\G(z)\xo) = -i-e''=l^-^»l + Ae''=^ + Be'''''' (676) 
k 

where k = (2mz)^/^ corresponds to the complex energy z. The first term satisfies the matching condition at the 
source, while the other two terms are "free waves" that solve the associated homogeneous equation. The coefficients 
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A and B should be adjusted such that the boundary conditions are satisfied. For the "well" we should ensure the 
Dirichlet boundary conditions ip^x) = for x = 0,L, while for the "ring" we should ensure the periodic boundary 
conditions V'(O) = ip{L) and ^-'(0) = ip'{L). 

Let us try to gain some insight for the solution. If z is in the upper half plane then we can write k ^ + ia where 
both kE and a are positive(!) real numbers. This means that a propagating wave (either right going or left going) 
exponentially decays to zero in the propagation direction, and exponentially explodes in the opposite direction. It is 
not difficult to conclude that in the limit of a very large L the coefRcients A and B become exponentially small. In 
the strict L — > oo limit we may say that ip{x) should satisfy "outgoing boundary conditions". If we want to make 
analytical continuation of G{z) to the lower half plane, we should stick to these "outgoing boundary conditions". The 
implication is that ijj{x) in the lower half plane exponentially explodes at infinity. 

An optional argument that establishes the application of the outgoing boundary conditions is based on the observation 
that the FT of the retarded G~^{lu) gives the propagator. The propagator is identically zero for negative times. If we 
use the propagator to propagate a wavepacket. we should get a non-zero result for positive times and a zero result for 
negative times. In the case of an unbounded particle only outgoing waves are consistent with this description. 



^= [26.5] Solving the Helmholtz equation in 1D/2D/3D 

The simplest method to find the Green function is by solving the Schrodinger equation with a source and appropriate 
boundary conditions. In the case of a free particle we get the Helmholtz equation which is a generalization of the 
Poisson equation of electrostatic problems: 

{W^ + kE^)G{r\ro) = -qS{r - To) (677) 

where the "charge" in our case is q = —2m/h^. For fc^; = this is the Poisson equation and the solution is the 
Coulomb law. For ks the solution is a variation on Coulomb law. We shall explore below the results in case of a 
particle in 3D. and then also for ID and 2D. 



The 3D case: 

In the 3D case the "Coulomb law" is: 

G(r|ro) = — rcos(fc£|r-ro|) (678) 

47r|r - rol 

This solution still has a gauge freedom, just as in electrostatics where we have a "free constant". We can add to this 
solution any " constant" , which in our case means an arbitrary (so called " free wave" ) solution of the homogeneous 
equation. Note that any "free wave" can be constructed from a superpostion of planar waves. In particular the 
"spherical" free wave is obtained by averaging e''' '' over all directions. If we want to satisfy the "outgoing wave" 
boundary conditions we get: 

G{r) ^ cos(fcEr) + i-^ sin(fc£;r) = -^c*'^'^'^ = — ^e*'=^'^ (679) 
47rr 47rr Airr 27rr 

The solutions for ID and 2D can be derived in the same way. 



The ID case: 

In the one dimensional case the equation that determines the Green function is 

+ 4) G(x) = ^qSix) (680) 

where for simplicity we set xq = 0. The delta function source requires a jump of the derivative G'{+0) — G'(— 0) = —q. 
Using different phrasing: in order for the second derivative to be a delta function the first derivative must have a step 
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function. Thus, when fc^; = we get the ID Coulomb law G{x) = —{q/2)\x\, but for fc^ 7^ we have a variation on 
Coulomb law 

G{x) = -^sm{kE\x\) (681) 

(see figure). To this we can add any ID free wave. In order to satisfy the "outgoing waves" boundary conditions we 
add cos(fca::) = cos(fc|x|) to this expression, hence we get the retarded Green's function in ID 

G{x) = i-^e*'-'^!"^! = -i^c''"^\^\ (682) 
2kE kE 




The 2D case: 

In two dimensions for fc^ = we use Gauss' law to calculate the electrostatic field that goes like 1/r, and hence the 
electrostatic potential is G{r) ~ — (l/(27r)) Inr. for fc^; 7^ wc get the modulated result 

G{r) = -|Yo(fcBr) (683) 

where Yo(x) is the Bessel function of the second kind 

Yo'(x) = -Yi(x) (684) 

/ 2 TT 

Yo(x) ~ \ — sin(a; ) , for large a; 

\ TTX 4 

The second independent solution of the associated homogeneous equation is the Bessel function of the first kind Jq {x) . 
Adding this "free wave" solution leads to a solution that satisfies the "outgoing wave" boundary conditions. Hence 
we get the retarded Green's function in 2D 

G{r) = dYLo{kEr) = -i^Ho(fc£r) (685) 

where Ho(a;) = Jo(.t) + iYo(a;) is the Hankcl function. 



== [26.6] The Green function via direct diagonalization 

For a free particle the eigenstates are known and we can calculate the expression by inserting a complete set and 
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integrating. From now on wc set m = 1 in calculations, but we restore it in the final result. 
G+{x\x^) = Y,{x\k)—^——{k\, 



(686) 



where d is the dimension of the space. In order to compute this expression we define f ~ x — xo and choose our 
coordinate system in such a way that the z direction will coincide with the direction of r. 



G+{x\xo) - 



2e 



ikr cos 9 



dVLk'^-^dk 



fc2 + iO {2t:Y 



(687) 



where ks ~ \ 2mE is the wavenumber for a particle with energy E. The integral is a d-dimensional spherical integral. 
The solutions of |k| = fc in ID give two fc's, while in 2D and 3D the fc's lie on a circle and on a sphere respectively. 
We recall that is 2,27r,47r in ID. 2D and 3D respectively, and define averaging over all directions for a function 
f{9) as follows: 



Umd = ^ / f{e)dn 



(688) 



With this definition we get 





cos(fcr), 


d=l 


Akr cos 6\ 

e )d = < 


Jo(fcr), 


d=2 




sinc(fcr) 


d==3 



(689) 



where Jo(a;) is the zero order Bessel function of the first kind. Substituting these results and using the notation z = kr 
we get in the 3D case: 



G+(r) 



z sm z 



1 1 



TT^r 2./-00 ~ 



1 1 

TT^r 4i 

— E 

27rr ^ 



-dz 



1 1 



z(e*' 



'-) 



iO TT^r Ai 7_oo ^e'^ — 2^ + lO 
ze*^ 

-dz 



dz 



(690) 



{z-{zE + iQ)){z+{zE+iO)) 



[z - {ZE + iQ)){z + {ZE + iO)) 



dz 



Res[/(z)] 



1 

2'nr 



-c e 

2 2 



-i(-ZE) 



27r 




-(E+iO) 



117 



The contour of integration (see figure) has been closed in the upper half of the plane for the term containing e'^, 
and in the lower half for the term with e~'^, so the arc at infinity add zero contribution. Then the contour has been 
deformed, so the contribution for each term comes from one of the poles at ±{ze + iO). We see that the solution is a 
variation on the familiar Coulomb law. 



== [26.7] Green function for a quasi ID network 

There is a very simple expression for the Green function of a particle that has energy E in a quasi ID network (see 
"QM is practice" section): 

G(x\xo) = -i—y"AJE)c''"'^'- (691) 



where p labels trajectories that start at xq and end at x. The length of a given trajectory is Lp, while A^(E) is 
the product of transmission and reflection amplitudes due to scattering at the nodes of the network. The proof 
of this formula is a straightforward generalization of the standard ID case: The function "^[x) = G(a;|a;o) satisfies 
the Schrodinger equation for any x xq, and in particular the matching conditions at the nodes are satisfied by 
construction. At xq the function ^{x) has the same discontinuity as in the standard ID case, because for the direct 
trajectory Lp — \x — xq\ has a jump in its derivative. This discontinuity reflects having a source at x = xq as required. 

Possibly the simplest quasi ID networks, is a line with one node. We can describe the node as a delta function 
potential: 



V{x) = u6{x) = \0)u{0\ (692) 

In the last equality we point out that V{x) is essentially a projector For this type of potential there is a 

simple inversion formula that we discuss below, that allows a direct evaluation of the resolvent. Optionally one can 
calculate the reflection and the transmission coefficients for a delta function potential, and use the general formula of 
the previous paragraph. 

The following inversion formula can be proved simply by selecting a basis that contains the ip as one of its states. In 
such basis the matrix in the denominator is diagjl— A, 1, 1, 1, ...}, 

^ ^ ^ IV')A(VI (693) 



1_|^)A(^| l-A 
A generalization of this formula for general invertible A, and B = |?/')A('0| is 



1 



A-B Ja 



1 - -^B^ 
VA 



-1 



1 I / 1 \ 1 „ 1 



a \1 -trace{A-^B) J a A 



-B- (694) 



We can use this formula in order to find the resolvent in the case of a delta function: 

= 71? = 7^)'^oVGo ^ Go{x\xo) + rGo{x\0)Go{0\xo) (695) 

where 

g{E) ^ Go(0|0) = -— (696) 

r = reflection coefficient = — ^("^/^e) (697) 

1 + i{u/ve) ^ ' 
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For Go(x|a:o) wc use the explicit result that has been derived for a free particle. By inspection we deduce that the 
transmission coefficient is t = 1 + r. 



== [26.8] Semiclassical evaluation of the Green function 

The semiclassical procedure, based on the Van-Vleck formula, provides a common approximation for all the special 
cases that have been discussed above, and also allows further generalization. The Green function is the FT of the 
propagator: 

j'OO 

G{X\X0) - / e'-^cl(^,=^0)+*£;t^i _ g*5e,(x,Xo) (ggg) 







In order to derive the last expression one applies the stationary phase approximation for the evaluation of the dt 
integral. The stationary point is determined by the equation E^i = E where E^i = —dA^i/dt. The reduced action is 
defined as 



ft rx{t) 

5,1 (x, xo) = Ai + -B = / [£,1 + E]dt= i p,, ■ dx (699) 

Jo Jx{Q) 



It is implicit in this definition that the reduced action is evaluated for a classical trajectory that has energy E and 
connects the points a;(0) and x(t), and likewise the classical momentum p^i is evaluated along this trajectory. 

== [26.9] The boundary integral method 

The Schrodinger equation Tiip — Eipo can be written as Ti.E'4' = where 

nE = -y^ + UEir) (700) 

with UE{r) = U{r) — E. Green's function solves the equation TiEG{r\rQ) = —q5{r — tq) with q = — 2m/fi,^. In this 
section wc follow the convention of electrostatics and set q = 1. From this point on we use the following (generalized) 
terminology: 

Laplace equation [no source]: TCEip{r) = (701) 

Poisson equation: 'Hs'tpir) = p{r) (702) 

Definition of the Coulomb kernel: 7i£;G(r|ro) = 5{r — ro) (703) 

The solution of the Poisson equation is unique up to an arbitrary solution of the associated Laplace equation. If the 
charge density p{r) is real, then the imaginary part of '0('") is a solution of the Laplace equation. Therefore without 
loss of generality we can assume that ^{r) and G'(r|ro) are real. From the definition of the Coulomb kernel it follows 
that the solution of the Poisson equation is 

Coulomb law: ^(r) = / G{r\r')p{r')dr' (704) 



In particular we write the solution which is obtained if we have a charged closed boundary r = r{s). Assuming that 
the charge density on the boundary is a(s) and that the dipole density (see discussion below) is d{s) we get: 

^{r) = I ([G{r\s)]a{s) + [dsG{r\s)]dis)ys (705) 



We use here the obvious notation G{r\s) = G{r\r{s)). The normal derivative 9 = n • V is taken with respect to the 
source coordinate, where n is a unit vector that points outwards. 
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It should be obvious that the obtained V'(^) solves the Laplace equation in the interior region, as well as in the exterior 
region, while across the boundary it satisfies the matching conditions 

Gauss law: ai/;(s+) - a?/'(s") = -cr(s) (706) 

^{s+)-ip{s-) = d{s) (707) 

These matching conditions can be regarded as variations of Gauss law. They are obtained by integrating the Poisson 
equation over an infinitesimal range across the boundary. The charge density cr[s) implies a jump in the electric field 
—dip{s). The dipole density d{s) is formally like a very thin parallel plates capacitor, and it implies a jump in the 
potential ?/'(s). 

Let us ask the inverse question: Given a solution of Laplace equation in the interior region, can we find a{s) and d{s) 
that generate it? The answer is yes. In fact, as implied from the discussion below, there are infinitely many possible 
choices. But in particular there is one unique choice that gives ip{r) inside and zero outside. Namely, a{s) = d'ijj{s) 
and d{s) — ^'ip{s), where ip{s) = ip{r{s)). Thus we get: 

The bounday integral formula: ^'(0 = <j> ([G{r\s)]d'il}{s) - [dsG{r\s)]'4j{s)Jds (708) 

The bounday integral formula allows to express the ipir) at an arbitrary point inside the domain using a boundary 
integral over ^p{s) and its normal derivative dip{s). 

The standard derivation of the boundary integral formula is based on formal algebraic manipulations with Green's 
theorem. We prefer below a simpler physics-oriented argumentation. If ')/'(''') satisfies Laplace equation in the interior, 
and it is defined to be zero in the exterior, then it satisfies (trivially) the Laplace equation also in the exterior. On 
top it satisfies the Gauss matching conditions with a{s) = dilj{s~) and d{s) = —4'{s~). Accordingly it is a solution of 
the Poisson equation with cr[s) and d{s) as sources. But for the same cr(s) and d{s) we can optionally obtain another 
solutions of the Poisson equation from the bounday integral formula. The two solutions can differ by a solution of 
the associated Laplace equation. If we supplement the problem with zero boundary conditions at infinity, the two 
solutions have to coincide. 

For the case where the wave function vanishes on the boundary ip{-^) = the expression becomes very simple 

i!{r)= <j,G{r\s')ip{s')ds' (709) 
where (p(s) = d'ip{s), and in particular as we approach the boundary we should get: 

Gis\s')Lp{s')ds' = (710) 



An obvious application of this formula leads to a powerful numerical method for finding eigenfunctions. This is the 
so called boundary integral method. Let us consider the problem of a particle in a billiard potential. Our Green's 
function is (up to a constant) 

G(s|s')-Yo(fcs|r(s)-r(s')|) (711) 

If we divide the boundary line into N segments then for any point on the boundary the equality J G{s\s')ip(s')ds' = 
should hold, so: 

^ A.jifij = with Ay = YoikE\r{s,) - r(s,)|) (712) 



Every time the determinant det(j4) vanishes we get a non trivial solution to the equation, and hence we can construct 
an eigenfunction. So all we have to do is plot the determinant det(j4) as a function of fc^. The points where dct{A) 
equals zero are the values of fc^ for which the energy E is an eigenvalue of the Hamiltonian Ti.. 
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[27] Perturbation theory 



[27.1] Perturbation theory for the resolvent 



If A and B are operators then 

oo 

--— = (713) 



n=0 



— ^ = (A(l - A-^B))-^ = (1 - A-^B)-^A-^ = VM-1b)"A-i = 4 + -B^ + ^B^B^ + ... 
A-B ^ ^ )i \ I l^"^ I A A A A A A 

71 

Consider 

^^'^ " = z - (Ho + V) = (z - Ho) - y ^^^^^ 

From the above it foUows that 

G{z) = Go(z) + Go(z)yGo(z) + Go(2)yGo(2)l/Go(2) + . . . (715) 
Or, in matrix representation 



G(2;|a;o) = Go(a;|2:o) + j GQ{x\x2)dx2{x2\V\xi)dxiGo{xi\xQ) + . . . (716) 
Note that for the scalar potential V = u{x) we get 

G(a;|a;o) = Go(a;|xo) + / dx' Ga{x\x')u{x')Go{x'\xQ) + ... (717) 



== [27.2] Perturbation Theory for the Propagator 

For the Green function we get 

G+(w) = G+{uj) + G+{uj)VG+{uj) + G+{uj)VG+{w)VG+{lo) + ... (718) 
Recall that 

G+{lu)^ FT ^ -iQ{T)U{T) (719) 
Then from the convolution theorem it follows that 



{~i)[Q{t)U{t)] = {-i)[QU^{t)] + {-lY J dt'[Q{t - t )U{t - t )] [V] [Q{t )U{t )] + ... (720) 
which leads to 

oo „ 

U{t)^Uo{t) + Y,i-^r / dtn...dt2dhUo{t~tn)V...Uo{t2-h)VUo{h) (721) 

n=i Jo<ti<t2<-<t„<t 
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for t > and zero otherwise. This can be illustrated diagrammatically using Feynman diagrams. 

Let us see how we use this expression in order to get the transition probability formula. The first order expression for 
the evolution operator is 

U{t) ^Uo-i J dt'Uoit ~~ t)VUo{t) (722) 

Assume that the system is prepared in an eigenstate m of the unperturbed Hamiltonian, we get the amplitude to find 
it after time t in another eigenstate n is 

{n\Uit)\m) = e-'^"*(5„™ - t J dt e-'^"'^'-'' \n\V\m)e~'^'-*' (723) 

If n 7^ m it follows that: 



Ptin\m) = \(n\U{t)\m)\^ = 







(724) 



== [27.3] Perturbation theory for the evolution 

In this section we review the elementary approach to solve the evolution problem via an iterative scheme with the 
Schrodingcr equation. Then we make the bridge to a more powerful procedure. Consider 

\^)=J2^,Mn) (725) 

n 

It follows that: 



dt 



= + ^\/„„.vi/„, (726) 



We want to solve in the method which is called "variation of parameters", so we set: 

*„(t) = c„(t)e-'^'^* (727) 
Hence: 

^ = -*E°'^''"^''"'^'^""'C"'W (728) 

n' 

From the zero order solution Cnit) ~ 5nrn we get after one iteration: 

Cn{t) = Snrn - i f dt'c^^^" V„,„ (729) 



Jo 

In order to make the connection with the formal approach of the previous and of the next section we write 

c„ = e^^"**„ = {n\Uo'Uit)\m) = {n\Ui(t)\m) (730) 
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and note that 

e'^''"-''-^'Vnm = {n\Uo{t)-^VUo{t)\m) = {n\Vi{t)\m) (731) 
Hence the above first order resuh can be written as 

{n\Ui{t)\m) = 5n,n - I I {n\Vi{t')\m) dt' (732) 

In the next sections we generahze this result to all orders. 

== [27.4] The Interaction Picture 

First wc would like to recall the definition of time ordered exponentiation 

U{t,ta) = (1 - idtNn{tN))---{l-idt2n{t2)){l-idt{H{ti)) = rc~*-/'4 (733) 

Previously we have assumed that the Hamiltonian is not time dependent. But in general this is not the case, so we 
have to keep the time order. The parenthesis in the above definition can be "opened" and then we can assemble the 
terms of order of dt. Then we get the expansion 

Uit^to) = \ - i{dt]sn{tN)) i{dt{H{ti)) + {~if{dtNn{tN)){dtN-in{tN-i)) + -.. (734) 

= i-i f ■H{t')dt' + {-if [ ■H{t")n{t')dt"dt' + ... 

Jto<t'<t Jto<t'<t"<t 



^(-i)" [ dtn...dtinitn)...niti) 

n=0 Jto<ti<t2-<t^<t 



Note that if 7i(i') ~ TL is not time dependent then we simply get the usual Taylor expansion of the exponential 
function where the 1/n! prcfactors would come from the time ordering limitation. 

The above expansion is not very useful because the sum is likely to be divergent. What we would like to consider is 
the case Ti. = Tio + V , where V is a small perturbation. The perturbation V can be either time dependent or time 
independent. We shall write the evolution operator as 

U{tM) = Uo{t,to)Ui{t,to)Uo{to,0) (735) 

So Ui ~ 1 ii V — 0, and otherwise it gives the effect of the interaction as if it happened at the reference time to. For 
simplicity we adopt from now on the convention to ~ 0, and use the notation U{t) instead of C/(t,to). By definition 
of H as the generator of the evolution we have: 

j^U{t) - -i(no + V)Uit) (736) 
We define 

Ui{t) = Uo{t)-'U{t) (737) 
Viit) - Uo{t)-^V Uo{t) 

With these notations the evolution equation takes the form 

j^Ui{t) = -iViit)Uiit) (738) 
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The solution is by time ordered exponentiation 

Uiit) = Tc^v(-i f Vi{t')dA = f^HT f dt,,...dhVi(t,,)...Vi{h) (739) 

V "'o / „_n Jo<ti< 



l<t2-<t„<t 



Which can be written formally as 



= J2hjr / dt,,...dhTVi{t„)...Vi{h) (740) 
n=o ' -^0 



Optionally we can switch back to the Schrodinger picture: 



oo ^ 

U{t) = Y.^-iT dt„...dt2dhUo{t-t„)V...Uo{t2-ti)VUo{ti) (741) 

n=o JQ<ti<t2<-<tn<t 

The latter expression is more general than the one which we had obtained via FT of the resolvent expansion, because 
here V is allowed to be time dependent. 



[27.5] The Kubo formula 



Consider the special case of having a time dependent perturbation V — f{t)A. The first order expression for the 
evolution operator in the interaction picture is 



Ui{t) = 1-i f Ai{t')f{t')dt' 
Jo 



(742) 



where Aj(t) is A in the interaction picture. If our interest is in the evolution of an expectation value of another 
observable B, then we have the identity 

{B)t = {i,{t)\Bm)) = {'^\Bn{t)\i,) = {4'\Ui{t)-^Bi{t)Ui{t)\^p) (743) 

To leading order we find 

{B)t = {Bi{t))+ f a{t,t')f{t')dt' (744) 



where the linear response kernel is given by the Kubo formula: 

a{t,t')^~i{[Bi{t),Ai{t')]) (745) 
In the above formulas expectation values without subscript are taken with the state if). 



[27.6] The S operator 



The formula that we have found for the evolution operator in the interaction picture can be re- written for an evolution 
that starts at t — — oo (instead of t = 0) and ends at t = oo. For such scenario we use the notation S instead of Ui 
and write: 

S = J2hjr dt^...dhTVi{t^)...Viih) (746) 
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It is convenient to regard t = —oo and t = oo as two well defined points in the far past and in the far future respectively. 
The limit t — > — oo and i ^ oo is ill defined unless S is sandwiched between states in the same energy shell, as in the 
context of time independent scattering. 

The S operator formulation is useful for the purpose of obtaining an expression for the temporal cross-correlation 
of (say) two observables A and B. The type of system that we have in mind is (say) a Fermi sea of electrons. The 
interest is in the the ground state ip of the system which we can regard as the "vacuum state" . The object that we 
want to calculate, written in the Heisenberg / Schrodinger / Interaction pictures is 

CBAit2,h) = {^\BH{t2)AHihm (747) 
= {^\U{t2)-^ BU{t2,h)AU(ti)\i;) (748) 
= {MUi{t2)-' Biit2)Uiit2,ti)Ai{ti)Ui{ti)\^) (749) 

where it is implicitly assumed that <2 > ii > 0, and we use the notation Ui{t2,ti) = Ui{t2)Ui{ti)~^ , which is the 
evolution operator in the interaction picture referenced to the time ti rather than to t^O. Note that S = ?7/(oo, — oo). 
Assuming that "0 is obtained from the non-interacting ground state via an adiabatic switching of the perturbation 
during the time — oo < t < we can rewrite this expression as follows: 

CBAit2,ti) = (^\Ui{(x),^^)-^Ui{^,t2)Bi{t2)Ui{t2,ti)Ai{ti)Ui{ti,-^)\^) (750) 
= {<l>\S^TSBj{t2)Ai{h)\<j>) (751) 

In the last line it is implicit that one should first substitute the expansion for S, and then (prior to integration) 
perform the time ordering of each term. The Gell-Mann-Low theorem rewrites the above expression as follows: 

{4,\TBH{t2)AH{hm = ^^'^'^^/i'^!!'^''^''^^ = (0|r^B,(t2)A,(ti)|0)„, (752) 

{(p\s\4>) 

The first equality follows from the observation that due to the assumed adiabaticity the operation of S" on is merely 
a multiplication by a phase factor. The second equality is explained using a diagrammatic language. Each term in the 
perturbative expansion is illustrated by a Feynman diagram. It is argued that the implicit unrestricted summation 
of the diagrams in the numerator equals to the restricted sum over all the connected diagrams, multiplied by the 
unrestricted summation of vacuum-to- vacuum diagrams (as in the numerator). The actual diagrammatic calculation 
is carried out by applying Wick's theorem. The details of the diagrammatic formalism are beyond the scope of our 
presentation. 



[28] Complex poles from perturbation theory 
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[28.1] Models of interest 



In this section we solve the particle decay problem using perturbation theory for the resolvent. We are going to show 
that for this Hamiltonian the analytical continuation of the resolvent has a pole in the lower part of the complex 
0-plane. The imaginary part ("decay rate") of the pole that we find is the same as we found by either the Fermi 
golden rule or by the exact solution of the Schrodinger equation. 

We imagine that we have an "unperturbed problem" with one energy level |0) of energy Eq and a continuum of levels 
|fc) with energies Ek- A model of this kind may be used for describing tunneling from a metastable state. Another 
application is the decay of an excited atomic state due to emission of photons. In the latter case the initial state would 
be the excited atomic state without photons, while the continuum are states such that the atom is in its ground state 
and there is a photon. Schematically we have 



n = Ha + V 
where we assume 



(753) 



(fc|F|0) = (Tk (coupling to the continuum) 
(fc'|V^|fc) = (no transitions within the contimmm) 



(754) 



Due to gauge freedom we can assume that the coupling coefficients are real numbers without loss of generality. The 
Hamiltonian matrix can be illustrated as follows: 



( Eo 




a'2 


0-3 


<yi 


El 













E2 
















V 



cTfe 










Ek 



(755) 



We later assume that the system under study is prepared in state |0) at an initial time (t 
the probability amplitude to stay in the initial state at a later times. 



0), and we want to obtain 



It is important to notice the following (a) we take one state and neglect other states in the well, (b) we assume that V 
allows transitions from this particular state in the well (zero state |0)) into the |fc) states in the continuum, while we do 
not have transitions in the continuum itself. These assumptions allow us to make exact calculation using perturbation 
theory to infinite order. The advantage of the perturbation theory formalism is that it allows the treatment of more 
complicated problems for which exact solutions cannot be found. 



[28.2] Heuristic approach 



The Hamiltonian of the previous section is of the form 









yp"^ 





(756) 



and the wavefunction can be written as 



(757) 
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In the decay problem of the previous section Hq is a 1 x 1 matrix, and is an oo x cxd matrix. The perturbation 
allows transitions between P and Q states. In the literature it is customary to define projectors V and Q on the 
respective sub-spaces. 

V + Q = I (758) 



Our interest is focused only in the piece of the wavefunction, which describes the state of the particle in the P space. 
On the other hand we assume that TL has a continuous spectrum, and therefore any E above the ground state defines 
a stationary state via the equation 



Ell) (759) 



From the second raw of the above equation it follows that x = G^V^^ij}, where = \/{E — H^). Consequently 
the reduced equation for -0 is obtained from the first raw of the matrix equation 

+ V^^G^V'^P] i; = Ei; (760) 

The second term in the square brackets S(£;) = V^'^G^V'^p is the so called "self energy" . Using the vague prescrip- 
tion E 1-^ E + iO we can split it into an effective potential due to so called "polarization cloud" (also known as Lamb 
shift) and an imaginary part due to FGR decay: 



= H^ + A-ir/2 (761) 

reP 

= 2n^VirVrjS{E - Er) (763) 



reP 



We argue that generates the time evolution of tp within P space. This effective Hamiltonian is non-hermitian 
because probability can escape to the Q space. It can be diagonalized so as to get the decay modes of the system. 
Each mode is characterized by its complex energy E^ — i(Tr/2). Note that the effective Hamiltonian is non-hermitian 
and therefore the eigen-modes are in general non-orthogonal [see discussion of generalized spectral decomposition in 
the Fundementals I section]. 

The advantage of the above treatment is its technical simplicity. However, the mathematical significance of the 
E 1-^ E + iO prescription is vague. We therefore turn to a more formal derivation of the same result, which illuminates 
how the non-hermiticity emerges once the Q continuum is eliminated. 



[28.3] The P + Q formalism 



We want to calculate the resolvent. But we are interested only in the single matrix element (0, 0) because we want to 
know the probability to stay in the |0) state: 



survival probability 



FT 



(0|G(c.)|0) 



(764) 



Here G{uj) is the retarded Green function. More generally we may have several states in the well. In such a case 
instead of P = 10) (01 we have 



p= 

nswell 



(765) 
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*|G(tj)|«') = FT {^\G^{uj)\^) 1 ^ (766) 



If we prepare the particle in an arbitrary state ^' inside the well, then the probability to survive in the same state is 
survival probability = FT 

Namely, we are interested only in one block of the resolvent which we call 

G^{z) = PG{z)r (767) 
Using the usual expansion we can write 

= PGP = P ,J , ^ P = P{G^ + GoVGa + ...)P (768) 

z- [Hq + V) 

= PGaP + PGo{P + Q)V{P + Q)GoP + PGo{P + Q)V{P + Q)Go{P + Q)V{P + Q)GqP + ... 

— nP I nPs^PnP i nPs^P nP\^P nP i 

1 

- z-{HP+^n 

where the "self energy" term 

^ yPQ^QyQP (7g9) 

represents the possibility of making a round trip out of the well. Note that only even order terms contribute to this 
perturbative expansion because we made the simplifying assumption that the perturbation does not have "diagonal 
blocks". In our problem E-^ is a 1 x 1 matrix that we can calculate as follows: 

k k 
k k 

where 

To = 2nJ2\Vk\^KE-Ek) = 27:giE)\V\'' (771) 

k 

which is in agreement with the Fermi golden rule. Thus, the resolvent is the 1x1 matrix. 

^''^^^ " z - {HP + S^) " z-{e^ + 5E^)+i{To/2) ^^^^^ 

We see that due to the truncation of the Q states we get a complex Hamiltonian, and hence the resolvent has a pole 
in the lower plane. The Fourier transform of this expression is the survival amplitude. After squaring it gives a simple 
exponential decay e~'"*. 
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Scattering Theory 



[29] The plane wave basis 



There arc several different conventions for the normalization of plane waves: 

• Box normalized plane waves \n) 

• Density normalized plane waves |fc) 

• Energy shell normalized plane waves \E, fl) 

We are going to be very strict in our notations, else errors are likely. We clarify the three conventions first in the ID 
case, and later in the 3D case, and then to remark on energy shell bases in general. 



[29.1] Plane waves in ID 



The most intuitive basis set originates from quantization in a box with periodic boundary conditions (a torus): 

\n) (773) 

where 

K = (774) 
Orthonormality: 

{n\m) = 5nra (775) 

Completeness: 

Y.\n){n\ = l (776) 

n 

The second convention is to have the density normalized to unity: 

|fc)_>e»'^-=^ (777) 

Orthonormality: 

(fc|fc') = 27r(5(fc-fc') (778) 
Completeness: 

\k)-{k\^i (779) 

ZTT 

Yet there is a third convention which assumes that the states are labeled by their energy, and by another index that 
indicate the direction. 

\E, n) = -^\kn) -^e'f^"^ (780) 
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where < E < oo and Q is the direction of propagation with uq = ±1, hence 



kn = nnks = ±V2mE (781) 
Orthonormahty: 

{E, n\E', n') = 2tt5{E ~ E')5nn' (782) 
Completeness: 

/dE 
Y.\E^n){EM = i (783) 

In order to prove the orthonormahty we note that dE = VEdk and therefore 

S{E - E') = —5{k - k') (784) 

VE 

The energy sheh normahzation of plane waves in ID is very convenient also for another reason. We see that the 
probability flux of the plane waves is normalized to unity. We note that this does not hold in more than ID. Still also 
in more than ID, the S matrix formalism reduces the scattering problem to ID channels, and therefore this property 
is very important in general. 

^^^= [29.2] Plane waves in 3D 



The generalization of the box normalization convention to the 3D case is immediate. The same applied to the density 
normalized plane waves: 

|fc) — ^ e*^^ (785) 
Orthonormahty: 

{k\k')^[2TTf5^{k-k') (786) 
Completeness: 



(27r)3 



1^)7:^(^1 = 1 (787) 



The generalization of the energy shell normalization convention is less trivial: 

\E,n) = ^^\kn) ^ ^-^e*-^ (788) 
27r ,,/vE 2tt ^Jve 

where we define the direction by = (0, (p), with an associated unit vector and a wavenumber 

hi = kEUa (789) 
Orthonormahty: 



(£:, vl\e' , ^) ^ 2ttS{e - E')6^{n - n') 



(790) 
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Completeness: 

^ / \E,n)dn{E,n\ = i (791) 

277 J 

To prove the identities above we note that 

dE dE 

d^k = k^dkdn = kl — dVL = kl — dipdcosO (792) 
ve ve 

and 

6^{k-k') = ^6{E ^ E')S^ {n-n') = ^S{E - E')S{ip - (p')S{cose - cosO') (793) 
k^ k^ 

In general we have to remember that any change of the "measure" is associated with a compensating change in the 
normaUzation of the delta functions. 

== [29.3] Optional energy shell bases 

Instead of the standard energy shell basis \E, fl) one can use some other basis \E, a), where a is a quantum number 
that labels the different basis states. In particular in 3D problem it is convenient to use the \E,£m) basis: 

{E',n\EJm) = 2tt5[E' - E)Y^"'{n) (794) 

Note that in position representation we have 

{rMEAm) = 2^ j,{kEr)Y'^{n) (795) 



This wavefunction is properly normalized as an energy-shell basis state. The asymptotic behaviour of the spherical 
Bessel function is ii{kr) ~ sin(fcr — phasc)/(fcr). and accordingly, as in the ID case, this wavefunction is also "flux 
normalized" : the flux of both the ingoing and the outgoing wave components is unity. 



[30] Scattering in the T-matrix formalism 
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[30.1] The Scattering States 



Our purpose is to solve the Schrodinger's equation for a given energy E. 

{Ho + V)^ = (796) 
If we rearrange the terms, wc get: 

iE-na-V)'9 = (797) 

In fact we want to find scattering solutions. These are determined uniquely if we define what is the "incident wave" 
and require outgoing boundary conditions for the scattered component. Thus we write ^' as a superposition of a free 
wave and a scattered wave, 

* = + (798) 
The scattered wave v[/«'=''** is required to satisfy outgoing boundary conditions. The free wave (j> is any solution of: 



7^00 = Eel) (799) 
Substituting, we obtain: 

{E-rio- l/)4''"="" = y0 (800) 
with the solution 

^scatt ^ Q+V(j) (801) 

leading to: 

^' = (1 + G+V)(j) (802) 



== [30.2] The Lippman Schwinger equation 

The explicit solution for that was found in the previous section is in typically useless, because it is difficult to get G. 
A more powerful approach is to write an integral equation for 'f. For this purpose we re-arrange the differential 
equation as 



{E - Ho)* = V^l- (803) 

Using exactly the same procedure as in the previous section we get 

^ = (/) + G+V^ (804) 

This Lippman Schwinger equation can be solved for ^' using standard techniques (see example in the next section). 
More generally closed analytical solutions cannot be obtained. Still if V is small we can try to find a solution iteratively, 
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starting with the free wave as a zero order solution. This leads to a perturbative expansion for which wc arc going 
to derive in a later section using a simpler approach. 

== [30.3] Example: scattering by a regularized delta function 



Let us demonstrate the procedure of solving Lippman Schwinger equation for scattering on a delta potential ud(x) in 
one dimension. The relation — (j) + CqV^^ is written as 

= e**^'^ - i— ^'(0)e*'''l^l (805) 

Setting a; = we obtain a closed equation for the unknown wave amplitude '5(0), whose solution leads to 

xl;(a;) = e^'=- + re^'^l-l , r = "'^^^^ (806) 

1 + i[u/ve) 

where r is identified as the reflection coefficient, while the transmission coefficient is t = 1 + r. It is instructive to 
consider the more general case of scattering by a regularized delta function in any dimension, and with any dispersion 
relation. This generalization is also known as s-scattering. In particular it illuminates how a divergent scries for the 
T matrix (see next section) can give a finite result, and why in the absence of regularization a delta function does not 
scatter in more than one dimension. By a regularized delta function we mean a potential V{x) that has the matrix 
elements 14, fc' = u for any k and k' within some finite momentum range. For example |fc| < A, where A is a large 
momentum cutoff. Another possibility which is of relevance in the analysis of Kondo problem is Ep < \k\ < A, where 
Ep is the Fermi energy. Using the Lippman Schwinger equation procedure we deduce that that the solution of the 
scattering problem is 

u 

*(x) = 0(x)+Go(x|O)w„„^(O), u^,, = — — (807) 

1 - uQ{E) 

where 

GiE) ^ G„(0|0) = E^T^TTo 

k 

Using the T matrix language of the next sections, one observes that T = u^u5{x) is simply a renormalized version 
of the potential V = uS{x). An optional method to derive this result is to generalize the calculation that we had 
presented for the Green function G = Go + GqEGq of a particle in quasi ID network. 

The k summation in the G{E) calculation should be treated with the appropriate integration measure. It equals —i/vp 
for a non-regularized Dirac delta function in one dimension. In higher dimensions G{E) has a real part that diverges, 
which implies that the scattering goes to zero. The regularization makes G{E) finite. In three dimensions we get 
g{E) = -(m/7r2)A£; where 

^---^'^ X (2^fc|3fc^-^-2^-l°4A3fcf)+^2^- 

Another interesting case to consider is having (say) a constant density of states with a threshold energy. The divergence 
of g{E) near a threshold is logarithmic. It is quite amusing that the second order as well as all the higher terms in 
perturbation theory are divergent, while their sum goes to zero... 

== [30.4] Perturbation Theory for the Scattering State 

Going back to the formal solution for 'I' we can substitute there the perturbative expansion of G 

G = Go + GoV^Go + GqVGoVGo + ... (810) 
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leading to 

= (1 + G+V)(j) = + GoV(t) + GoVGoVcj) +... (811) 

As an example consider the typical case of scattering by a potential V(x). In this case the above expansion to leading 
order in space representation is: 

*(a-) = 0(x)+ ( Go{x,x')V{x')(j){x')dx' (812) 



[30.5] The T matrix 



It is customary to define the T matrix as follows 

T = V + VGoV + VG^VGqV + ... = V + VGV (813) 

The T matrix can be regarded as a "corrected" version of the potential V ^ so as to make the following first order 
look-alike expression exact: 

G Go + GoTGo (814) 
Or the equivalent expression for the wavcfunction: 

^' = + Gj"r0 (815) 
Later it is convenient to take matrix elements in the unperturbed basis of free waves: 

= (ri'l^l'/'^) (816) 

T^p{E) = (nr(£;)|/) 

In principle we can take the matrix elements between any states. But in practice our interest is in states that have 
the same energy, namely Eq. = Ep = E. Therefore it is convenient to use two indexes (E,a), where the index a 
distinguishes different free waves that have the same energy. In particular a may stand for the "direction" (il) of the 
plane wave. Thus in practice we are interested only in matrix elements "on the energy shell": 

Tafc(i?) = (0^'°|T(i?)|0^-'') (817) 

One should be very careful to handle correctly the different measures that are associated with different type of indexes. 
In particular note that in 3D: 

1 fkE^'' 



[30.6] Scattering states in ID problems 



In this section we look for a scattering solution that originates from the free wave \ko). Using the result of a previous 
section we write ^ = (f)^" + GqTc/)'"' with 

(j)''«{r) = e''"'"' [density normalized] (819) 
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In Dirac notations: 

I*) = \(l)^'>) + G^T\(t>^") (820) 
In space representation: 

(xl*) = + (x|Go+r|0'=°) (821) 

or in "old style" notation: 

■^{x) = 0'=°(a;) + J G+{x\xQ)dxQ{xo\T\ko) (822) 
In ID problems the green function is 

G+{x\xo) = {x\G+\xo) = _^e'k^\---o\ (823) 

Ve 

Thus we get: 

,p(x) = e''^^«"-— /e'^^l"-"°ldxo(a;o|T|fco) (824) 

VE J 

By observation of the left and right asymptotic regions we deduce that the reflection coefficient is 

r = -— (-fco|T|fco) (825) 

VE 

while the transmission coefficient is 

t = I - — {ko\T\ko) (826) 

VE 

== [30.7] Scattering states in 3D problems 

The derivation is the 3D case follows the same procedure as in the ID case. The incident wave is 

0fco(^) = e''''°-^ [density normalized] (827) 
and the implied scattering state is 

<Sf{r) = cl>'">{r)+ J G+{r\r„)dro{ro\T\k„) (828) 

where 



G+{r\ro) = (r|G+|r„) = ^^—^ (829) 
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Thus we get 

^{r) = (r) 



m 

2^ 



j^ifcelr-rol 

\r - ro\ 



{ro\T\ko) dro 



(830) 



So far everything is exact. Now we want to get a simpler expression for the asymptotic form of the wavefunction. 
Note that from the experimental point of view only the "far field" region (far away from the target) is of interest. 
The major observation is that the dro integration is effectively bounded to the scattering region |r| < tq where the 
matrix elements of V and hence of T are non-zero. Therefore for \r\ ^ |ro| we can use the approximation 



\r- fo\ = Vir-roV = vW^2f^^bTO(h7F) « \r\ 

Here and below we use the following notations: 
r = \r\nn 

n = {9, If) ~ spherical coordinates 
= (sin0 cos (/), sin0 sin0, cos 0) 



1 - '^o 
r 



r\ - nn ■ ro 



(831) 



(832) 





With the approximation above we get 



^(r) 



Akn-f 



2tt r 



(ro|r|fco) dro 



where 



m 



f{n)^-—{h,\T{E)\ko) 

ZTT 



It follows that the differential cross section is 



da 

dn 







(—Y 

\2TThJ 


^ kn ,ko 



pikE\r\ 



(833) 



(834) 



(835) 



This formula assumes density normalized plane waves. It relates the scattering which is described by f{^) to the T 
matrix. It can be regarded as a special case of a more general relation between the S matrix and the T matrix, as 
discussed in later sections (and see in particular the discussion of the "optical theorem" below). 
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== [30.8] Born approximation and beyond 

For potential scattering the first order approximation T V leads to the Born approximation: 

f{n) = -^{k,,\TiE)\ko) « ~^V{q) (836) 

where q = kn — kg and V{cf) is the Fourier transform of V{r). The corresponding formula for the cross section is 
consistent with the Fermi golden rule. 

It is customary in high energy physics to take into account higher orders. The various terms in the expansion are 
illustrated using Feynman diagrams. However, there are circumstance where we can gain better insight by considering 
the analytical properties of the Green function. In particular we can ask what happens if G has a pole at some complex 
energy z = Er — i{rr/2). Assuming that the scattering is dominated by that resonance we get that the cross section 
has a Lorentzian line shape. More generally, If there is an interference between the resonance and the non-resonant 
terms then we get a Fano line shape. We shall discuss further resonances later on within the framework of the S 
matrix formalism. 

^= [30.9] The Optical Theorem 

From the energy-shell matrix elements of the T operator we can form an object that later we identify as the S matrix: 

Sab = Sab-^{E,a\T\E,b) (837) 

Using a more sloppy notation the equation above can be written as 5 = 1 — iT. We should remember that S is not 
an operator, and that Sab are not the matrix elements of an operator. In contrast to that Tab are the on-shell matrix 
elements of an operator. This relation assumes that the energy shell basis states are properly normalized. 

The S matrix, as we define it later, is unitary. So we have S^S = 1, and therefore we conclude that the T matrix 
should satisfy the following equality: 

(yt - T) = iT^T (838) 

In particular we can write: 

{ao\T -T^ao) = -iJ2{ao\T\a){a\T^ao) (839) 

a 

and we get: 

J2\Taa„f = -'^MTaoao] (840) 
a 

This so called "optical theorem" establishes a connection between the "cross section" and the forward scattering 
amplitude Tkoao- It should be clear that if this relation holds in one particular energy-shell basis, then it holds also 
in any other (properly normalized) energy shell basis. In 3D scattering problems we can write one of the following 
expressions: 

^|(^£/.m|7^|0BA,mo)|2 ^ _2Im[(0^A' |T| '™« )] (841) 

i,m 

^\{<j)^'^\T\<j)^'^<>)\^ = -2lm[{(t,^'^°\T\(t)^'^«)] 
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Using the relations 

\E,n) = ^^\kn) (842) 
Jve 27r 



f{n) = -^{kn\T{E)\k,) 
we get the familiar versions of this theorem: 

\{kn\T\k^)?dn = -2vEC^\ Im[(fco|T|fco)] (843) 



fc_E 
4-7r 

\imHn = —iu,[fm 

ks 
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[31] Scattering in the 5- matrix formalism 
== Channel Representation 



Before we define the S matrix, Let us review some of the underlying assumptions of the S matrix formahsm. Define 
pix) = l^ixf (844) 
The continuity equation is 

| = -V.J (845) 

We are working with a time-independent Hamiltonian and looking for stationary solutions, hence: 

V • J = (846) 

The standard basis for representation is \x). We assume that the wave function is separable outside of the scattering 
region. Accordingly wc arrange the basis as follows: 

\x £ inside) = the particle is located inside the scattering region (847) 
|a,r) = the particle is located along one of the outside channels 

and write the quantum state in this representation as 

I*) = E ¥^(2^) + E^'^W (848) 
KG inside 

The simplest example for a system that has (naturally) this type of structure is a set of ID wires connected together 
to some "dot". In such geometry the index a distinguishes the different wires. Another, less trivial example, is a lead 
connected to a " dot" . Assuming for simplicity 2D geometry, the wavefunction in the lead can be expanded as 

*(x) = ^{r,s) = E^"W^"(*) (849) 

a 

where the channel functions (waveguide modes) are: 

X°(s) = J^sinff^a) s) with a = 1, 2, 3 . . . and < s < £ (850) 



e \\£ 

In short, we can say that the wavefunction outside of the scattering region is represented by a set of radial functions: 
^{x) ^ Ra{r) where a = 1, 2, 3 . . . and < r < oo (851) 



The following figures illustrate several examples for scattering problems (from left to right): three connected wires, 
dot-waveguide system, scattering in spherical geometry, and inelastic scattering. The last two systems will be discussed 
below and in the exercises. 
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[31.2] The Definition of the S Matrix 



Our Hamiltonian is time-independent, so the energy E is a good quantum number, and therefore the Hamiltonian 
H is block diagonal if we take E as one index of the representation. For a given energy E the Hamiltonian has an 
infinite number of eigenstates which form the so called "energy shell". For example in 3D for a given energy E we 
have all the plane waves with momentum \k\ = Ue, and all the possible superpositions of these waves. Once we are 
on the energy shell, it is clear that the radial functions should be of the form 



Ra{r) = AaR^''-{r)-BaR 



-(r) 



(852) 



For example, in case of a waveguide 
1 



R 



E,a,± 



(r) = 



[flux normalized] 



(853) 



where the radial momentum in channel a corresponds to the assumed energy E^ 



(854) 



and the velocity iv = ka/m in channel a is determined by the dispersion relation, 
wavefunctions can be represented by a set of ingoing and outgoing amplitudes: 



Thus on the energy shell the 



{Aa,Ba) 



with a = 1, 2, 3 . 



(855) 



But we should remember that not all sets of amplitudes define a stationary energy state. In order to have a valid 
energy eigenstate we have to match the ingoing and outgoing amplitudes on the boundary of the scattering region. 
The matching condition is summarized by the S matrix. 



(856) 



By convention the basis radial functions are "flux normalized" 
\Ba\^ — l^aP and from the continuity equation it follows that 



Consequently the current in channel a is 



(857) 



From here follows that the S matrix is unitary. The unitarity of the S matrix sometimes implies surprising conse- 
quences. For example: in ID the transmission from-left-to-right must be equal to the transmission from-right-to-left. 
More generally: if we have a system with two different leads 1 and 2 then X^aei 662 \^ba\'^ — Sae2 6ei I'^'baP- The 
latter observation is essential for the theory of the two-terminal Landauer conductance. 
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In order to practice the definition of the S matrix consider a system with a single 2D lead. Let us assume that the 
lead has 3 open channels. That means that ka is a real number for a = 1, 2, 3, and becomes imaginary for a > 3. The 
a > 3 channels arc called "closed channels" or "evanescent modes". They should not be included in the S matrix 
because if we go far enough they contribute nothing to the wavefunction (their contribution decays exponentially). 
Thus we have a system with 3 open channels, and we can write 

-^'=1'^ - Bie+'^'i'') (858) 

and 

= S I ^2 I (859) 



Ri{r) = 


^(^le- 
^/vl 


R2{r) = 




^3(r-) = 


^(^36 



== [31.3] Example: scattering by a delta junction 

The simplest non-trivial example for an S matrix, is for a system which is composed of two ID wires labeled as i = 1, 2, 
attached by a junction that is modeled as a delta barrier u5{x). Using the standard " s-scattering" convention the 
channel wavefunctions arc written as 4'(r) cx Aiexp(— ifcr) — Biexjp{ikr) where r = \x\. The linear relation between 
the As and the Bs is determined by the matching conditions. The matching conditions for the channel wavefunction at 
the origin r = in the case of a delta barrier are discussed in the " QM in practice I" section, where it is also generalized 
to the case of having a junction with M wires. The S matrix which is implied by these matching conditions is 

S = l^] , [such that S=l for u=oo] (860) 

with 

VE = {2E/mf''^ (861) 

t = ,^}, , (862) 

1 + 1[U/VE) 

r = -1 + t (863) 

(864) 

In practice it is more convenient to use in the present context an ad-hoc convention of writing a raw-swapped S 
matrix, corresponding to the re-definition of the outgoing amplitudes as Bi = —B2 and B2 = —Bi, namely, 

s=(l ;) = c- (_,^,» ^0 [-^h -='^ 

with 

7 = arg(<) = — arctan^u / V e) mod (tt) (866) 

^ - TTW^ - ^'''^ 

a = (868) 
(j) = (869) 
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The advantage of the ad-hoc convention is the possibility to regard m = as the unperturbed Hamiltonian, and to 
use the T matrix formaUsm to find the S matrix. AU the elements of the T matrix equal to the same number T, and 
the equality S ^ 1 — iT implies that r = —iT and t = 1 — iT. 



== [31.4] Scattering states 

Let us define the unperturbed Hamiltonian TLq as that for which the particle cannot make transitions between channels. 
Furthermore without loss of generality the phases of R"-^(r) is chosen such that Sab = ^ab, or equivalently Ba = Aa, 
should give the "free wave" solutions. We label the "free" energy states that correspond to the Hamiltonian Ho as 
1^). In particular we define a complete set that are indexed by a = {E,a), Namely, we define 

10") = |0^^»-=) ^ Sa,ajR''-{r)-R''+{r)) (870) 

The following figure illustrates how the "free wave" \4>^''^) 1 of a three wire system looks like. 



1 




2 



3 



It is clear that the states {(f)^''^) form a complete basis. Now we take the actual Hamiltonian H that permits transitions 
between channels. The general solution is written as 

I*") ^ AaR''-{r)-BaR''+{r) (871) 

where Ba = SabAb or the equivalent relation Aa = {S~^)abBb- In particular we can define the following sets of 
solutions: 

= 1*^°'^°+) ^ <5a,a„i?"-(r)-5a,a„i?"+(r-) (872) 

= 1*^°"°-) ^ (^-^)a,a„i?"-(r)-<5a,a„i?'^+(r) 

The set of (+) states describes an incident wave in the channel {Aa = Sa^a^) and a scattered wave that satisfies 
"outgoing" boundary conditions. The sign convention is such that |0") is obtained for S' = 1. The set of (— ) states 
is similarly defined. We illustrate some of these states in the case of a three wire system. 




3 3 3 



== [31.5] Time reversal in scattering theory 

It is tempting to identify the (— ) scattering states as the time reversed version of the (+) scattering states. This is 
indeed correct in the absence of a magnetic field, when we have time reversal symmetry. Otherwise it is wrong. We 
shall clarify this point below. 
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Assuming that we have the solution 



The time reversed state is obtained via complex conjugation: 



(873) 



(874) 



This should be contrasted with 



ikx 



■ 8 e 



ikx 



(875) 



We see that the two coincide (disregarding a global minus) only if 

S* = s-^ 



(876) 



which means that the S matrix should be symmetric (S*^ = S). This is the condition for having time reversal 
symmetry in the language of scattering theory. 



== [31.6] Orthonormality of the scattering states 

The (+) states form a complete orthonormal basis. Also the (— ) states form a complete orthonormal basis. The 
orthonormality relation and the transformation that relates the two basis sets are 



(-Bi,ai,+|£;2,a2,+) 
{Ei,ai,-\E2,a2,-) 
{Ei,ai,-\E2,a2,+) 



= 2nS{Ei - E2)da,^a, 
= 27r6{Ei - E2)Sa,,a, 
= 27rS{Ei - E2)Sa,^a, 



(877) 
(878) 
(879) 



The last equality follows directly from the definition of the S matrix. Without loss of generality we prove this lemma 
for the 3 wire system. For example let us explain why it is true for ai = 1 and 02 = 2. By inspection of the figure in 
a previous section we see that the singular overlaps comes only from the first and the second channels. Disregarding 
the "flux" normalization factor the singular part of the overlap is 



{Ei,l,~\E2,2,+) 






/•OO 






- 5i2e+*«'' 


dr + 






singular 




Jo 










Jo 



— iknr 



dr 



(5-^)210"*'^ 

S'i2e-*('=-'=°)''(ir + / 5i2e-'('^-'^«)''dr = 27rJ(fc - fco)5'i2 



where is the second line we have changed the dummy integration variables of the second integral from r to — r, 
and used (5^^)21 = 5*12. If we restore the "flux normahzation" factor we get the desired result. One can wonder 
what about the non-singular contributions to the overlaps. These may give, and indeed give, an overlap that goes 
like 1/{E ± Eq). But on the other hand we know that for E ^ Eq the overlap should be exactly zero due to the 
orthogonality of states with different energy (the Hamiltonian is Hermitian). If we check whether all the non-singular 
overlaps cancel, we find that this is not the case. What is wrong? The answer is simple. In the above calculation we 
disregarded a non-singular overlap which is contributed by the scattering region. This must cancel the non-singular 
overlap in the outside region because as we said, the Hamiltonian is Hermitian. 



^^^^= [31.7] Getting the S matrix from the T matrix 

The derivation of the relation between the S matrix and the T matrix goes as follows: On the one hand we express 
the overlap of the ingoing and outgoing scattering states using the S matrix. On the other hand we express it using 



143 



the T matrix. Namely, 



^1) 



By comparing the two expressions we derive a relation between the 5" matrix and the T matrix. 



(882) 



or in abstract notation S = 1 — iT. Another way to re-phrase this relation is to say that the S matrix can be obtained 
from the matrix elements of an 5* operator: 



27rS{Ei - E2) Sa,a 



(883) 



where the S operator is the evolution operator in the interaction picture. Within the framework of the time dependent 
approach to scattering theory the latter relation is taken as the definition for the 5* matrix. The two identities that we 
prove in the previous and in this section establish that the time-independent definition of the S matrix and the time 
dependent approach are equivalent. The rest of this section is dedicated to the derivation of the T matrix relation. 

We recall that the scattering states are defined as the solution of H'i' = E'i> and they can be expressed as 



^E2,a2,+ ^ ^ G+(i;2)V^)(/>^''"' 

^^i'"!'- = (1 + G"(£:i)y)0'^i'"i 



1 



E2-H + iO 
1 

El -Ho- iO 



(884) 



TiEi 



where the last equality relays on GV = GqT. In the following calculation we use the first identity for the "ket", and 
after that the second identity for the " bra" : 



1 



E2-n + iQ 



1 



V 



-V 



E2~Ei+ iO 



J,-E2,02\ 



A-Ei,ai 



A-Ei,ai 



A-Ei,ai 



l + TiEi) 



1 



Ei-Hq+ iO 



T{Ei 



E1-E2+ iO 
T{Ei) 



kE2,a.2 



E1-E2+ iO 



V 



E2-E1+ iO 



lE2-CL2 \ 



kEi ,ai 



V + T{Ei)G+{Ei)V 



E2-E1+ iO 



kE2,a2\ 



i-B2,a2 



lEi ,ai 



27rS{Ei - E2)5a,a2 - i2^5{Ei - E2){(b'''-''' \T{Ei 



T{Ei) 



E2-E1+ iO 



iE2,a2\ 



iE2,a2\ 



(885) 



where before the last step we have used the relation V + TGqV = T. 



=== [31.8] Subtleties in the notion of cross section 

Assume that we have a scattering state We can write it as a sum of "ingoing" and "outgoing" waves or as a sum 
of "incident" and "scattered" waves. This is not the same thing! 



^ i7igoing ^~ ^outgoing 



(886) 
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The "incident wave" is a "free wave" that contains the "ingoing" wave with its associated "outgoing" component. 
It corresponds to the Hq Hamiltonian. The "scattered wave" is what we have to add in order to get a solution to 
the scattering problem with the Hamiltonian Ti.. In the case of the usual boundary conditions it contains only an 
"outgoing" component. 



^ ingoing ^ outgoing 





scattered 



11/ 

incident 





Below is an illustration of the ID case where we have just two directions of propagation (forwards and backwards): 
^ingoing - - incident 



outgoing 



I ' "^outgomg ^sca^ered |' 



The S matrix gives the amplitudes of the outgoing wave, while the T — i{l — S) matrix gives the amplitudes of the 
scattered component (up to a phase factor). Let us consider extreme case in order to clarify this terminology. If we 
have in the above ID geometry a very high barrier, then the outgoing wave on the right will have zero amplitude. 
This means that the scattered wave must have the same amplitude as the incident wave but with the opposite sign. 

In order to define the "cross section" we assume that the incident wave is a density normalized {p = 1) plane wave. 
This is like having a current density J ~ pvs- If we regard the target as having some "area" a then the scattered 
current is 

iscattered = {rVe) X (7 (887) 

It is clear from this definition that the units of the cross section are [a] = L'^~^. In particular in ID geometry the 
"differential cross section" into channel a, assuming an incident wave in channel oq is simply 

<Ja = \Ta.aof (888) 

and by the "optical theorem" the total cross section is 

= -2Im[Tao,ao] (889) 

The notion of " cross section" is problematic conceptually because it implies that it is possible for the scattered flux to 
be larger than the incident flux. This is most evident in the ID example that we have discussed above. We see that 
the scattered wave is twice the incident wave, whereas in fact the forward scattering cancels the outgoing component 
of the incident wave. Later we calculate the cross section of a sphere in 3D and get twice the classical cross section 
(27ra^). The explanation is the same - there must be forward scattering that is equal in magnitude (but opposite in 
sign) to the outgoing component of the incident wave in order to create a "shadow region" behind the sphere. 
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== [31.9] The Wigner time delay 

A given element of the S matrix can be written as 

Sab = Vae'' (890) 

where < 51 < 1 is interpreted as either transmission or reflection coefRcient, while 9 is called phase shift. The cross 
section is related to g while the Wigner time delay that we discuss below is related to theta. The Wigner time delay 
is defined as follows: 

T.^UE) - h— (891) 

Consider for example the time delay in the case of a scattering on a "hard" sphere of radius R. We shall sec that in 
such case we have a smooth energy dependence 6 w —2kR and consequently we get the expected result r « —2R/ve- 
On the other hand we shall consider the case of a scattering on a shielded well. If the energy is off resonance we get 
the same result r « —2R/ve- But if E is in the vicinity of a resonance, then we can get a very large (positive) time 
delay r ~ h/Tr, where is the so called "width" of the resonance. This (positive) time delay reflects "trapping" and 
it is associated with an abrupt variation of the cross section as discussed in a later section. 

Let us explain the reasoning that leads to the definition of the Wigner time delay. For this purpose we consider the 
propagation of a Gaussian wavepackct in a given channel: 

^{x) = [ dkc-'^''^''- y/^cxp[i{k{x~xo)+e-Et)] (892) 



where both g and 9 and E are functions of fc. In order to determine the position x of the wavepaket we use the 
stationary phase approximation. Disregarding the Gaussian envelope, and regarding g as constant at the energy 
range of interest, most of contribution to the dk integral comes from the k region where the phase is stationary. This 
leads to 

d9 dE , , 

X-XO + — - — 1 = (893) 

The position x of the wavepacket is determined by the requirement that the above equation should have a solution 
for fc ~ /cq- Thus we get 

x = xo + X (t~ Tdrf.y) (894) 

where Wg^oup = {dE /dk) is the group velocity for k ~ kg and t^^i^^ is the Wigner time delay as defined above. How 
this is related to a scattering problem? Consider for example scattering on one dimensions where we have " left" and 
"right" channels. Assume that the wavepacket at t = is an undistorted Gaussian with 9 — 0. The initial wavepacket 
is centered on the "left" around x — xq with momentum k = k^. We would like to know how x evolves with time 
before and after the scattering. Say that we observe the transmitted part of the wavepacket that emerges in the 
"right" channel. Following the above analysis we conclude that the effect of introducing 7^ is the Wigner time 
delay. 



^= [31.10] The Friedel phase and the DOS 

The issue that we discuss in this section is how to determine the number M{E) of energy levels in the scattering 
region if we know how the S matrix depends on the energy E. This would be the number of no interacting Fermions 
that can be accommodated there at zero temperature. More precisely what we can extract from S{E) is the density 
of states g{E) in the scattering region. To have a clear idea of the physical picture it is best to imagine a single lead 
system, where the lead is a waveguide with A4 modes, and the "depth" of the scattering region is L. In such system 



g{E) ~ Mxvey ~ Mx Td^ay 



(895) 



146 



where r^^i^y ^ L/v-^ is the semiclassical estimate for the time delay. We shall see that the so called Friedel-sum-rule 
is merely a refinement of the latter formula. In complete analogy with the Wigner formula, that express t^^i^^ as the 
E derivative of the scattering phase shift, we are going to express q{E) as the E derivative of the "Friedel phase". 

The most illuminating way to define the energy levels of the scattering region is to impose Dirichlet boundary condition 
at the r = section. This leads to the equation (say for 3 modes): 



(896) 

This equation has a non-trivial solution if and on if 

det (s(£;) - l) = (897) 



Form this condition we can extract the eigen-encrgies. Recall that the S matrix is unitary. Therefore its M complex 
eigenvalues e**'' are located on a circle. We get zero determinant in the above equation each time that one of the 
eigen-phases cross through 9 = 0. Thus the mean level spacing is simply I-k j M. divided by the average "velocity" 
dejdE. This leads to 



27r \ dE ) 2711 \dE 

where the Friedel phase is = X^r equality is easily derived by calculating the trace in the basis in which 

S is diagonal. 

A straightforward generalization of the above considerations allow to calculate the number of fcrmions that are emitted 
from the scattering region if a parameter X is being varied very very slowly such that at any moment wc have zero 
temperature occupation. The result can be written as dN = —GdX where 

G = -Ltrace ( %S^\ (899) 



2TTi \ dX 

The further generalization of this relation to multi-lead geometry has been discussed by Brouwer following the work 
of Buttikcr Pretre and Thomas, and is known as the scattering approach to quantum pumping. 

^== [31.11] Phase shifts and resonances 

If we have only one wire (hence one channel) the S matrix is 1 x 1, and accordingly we can write 

^00 = exp[i2(So(S)] (900) 
Too - -e'^"2sin(,5o) (901) 

where So{E) is known as the phase shift. If the scattering potential is short range one can use a matching procedure in 
order to determine the phase shift. This will be discussed in the next lecture. Another possibility is to use perturbation 
theory via the T matrix formalism: 

Too = Vbo + {VGV)qo = Born expansion (902) 

It is essential to calculate the matrix elements using properly flux-normalizcd free waves: 

^ e-^'^'^'" - -^e+''=^'' = -i^=2sm{kEr) (903) 



There arc two limiting cases of particular interest. 
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• First order non-rcsonant scattering by a weak potential V 

• Resonant scattering which is dominated by a single pole of G 

In the case of scattering by a weak potential we can use the first order Born approximation for the T matrix: 

Too ~ Voo = \V\(I)'') = — Vir){sm{kEr)fdr (904) 



VE Jo 

The assumption of weak scattering implies ^ Ij leading to the first order Born approximation for the phase shift: 
/ V(r)(sm(kEr)fdr (905) 

VE Jo 

This formula is similar to the WKB phase shift formula. It has a straightforward generalization to any i which we 
discuss in the context of spherical geometry. We note that we have manged above to avoid the standard lengthy 
derivation of this formula, which is based on the Wronskian theorem (Messiah p. 404). 

The other interesting case is resonant scattering where 

Too « iVGV)oo = ^_£;^ + ,(rj2) 

From the optical theorem 2Im[Too] = — |TooP we can deduce that the numerator, if it is real, must equal F^. Thus we 
can write this approximation in one of the following equivalent ways: 

-e-e;tw^) ^ ^ 

tan(5,) = -^^^ (909) 

Note that if the optical theorem were not satisfied by the approximation we would not be able to get a meaningful 
expression for the phase shift. In order to prove the equivalence of the above expressions note that Sq can be regarded 
as the polar phase of the complex number z = (E—Er) — i{Tr/2). 

The Wigner time delay is easily obtained by taking the derivative of do{E) with respect to the energy. This gives a 
Lorentzian variation of t^^i^^ as a function of (E—Er). The width of the Lorentzian is F^, and the time delay at the 
center of the resonance is of order h/Tr- 
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[32] Scattering in quasi ID geometry 

In this lecture we consider various scattering problems is which the radial functions are exp(±ikar), where a is the 
channel index. In order to find a scattering solution we have to use a matching procedure. Namely the ingoing and 
outgoing free wave solutions should match an interior solution within the scattering region. This requirement imposes 
a boundary condition that relates the wavefunction amplitude and its derivative on the boundary. The simplest 
example for this class of systems is a network that consists of M semi ID wires that are connected at one junction. 
The problem of s-scattering {£ = 0) in spherical geometry is formally the same as M=l "semi ID wire" problem. We 
shall discuss the general case (any £) in a later lecture. The ID scattering problem on a line, where we have "left" 
and "right" leads, is formally an Ai=2 "semi ID wire" system. Then it is natural to consider less trivial Ai channel 
systems that can be regarded as generalizations of the ID problem, including scattering in waveguides and multi-lead 
geometries. Also inelastic scattering in ID can be formally re- interpreted as an A4 channel "semi ID wire" problem. 



[32.1] The matching procedure 



In this section we would like to outline a procedure for finding an exact result for the phase shift is quasi ID one 
channel geometry. This procedure will allow us to analyze s-scattering by "hard" spheres as well as by "deep" wells. 
The key assumption is that we have a finite range potential which is contained within the region r < R. The boundary 
of the scattering region at r ~ i? is fully characterized by the logarithmic derivative ko{E). The definition of the latter 
is as follows: given the energy E, one finds the regular solution ipif) of the Schrodingcr equation in the interior (r < R) 
region; Typically this solution is required to satisfy Dirichlct boundary conditions at the origin ?■ = 0; Once the regular 
solution is known, the logarithmic derivative at the boundary is calculated as follows: 



ko 



1 dip{r) 
ip{r) dr 



(910) 

r=R 



The derivative should be evaluated at the outer side of the boundary. For each problem fcg {E) should be evaluated 
from scratch. But once it is known we can find both the E < {) bound states and the E > Q scattering states of the 
system. 

Finding the bound states: In the case of a bound state the wavefunction in the outside region is -0(r) oc exp(— |fc£;|r). 
The matching with the interior solution gives the equation fco(£') — ~\kE\- This equation determines the cigen-energies 
of the system. In order to have bound states fcp should become negative. A necessary condition for that is to have an 
attractive potential {V <0). 

Of particular interest is the long-wavelength limit, which means ksR ^ 1 where R is the radius of the scattering region. 
Then the regular solution for r ^ R^ where the potential is zero, has the form ipir) cx x — a^. The parameter is 
called the scattering length. The logarithmic derivative at r = Ris ko = 1/{R — Qs). If we have > R then it becomes 
negative and we find a single bound state whose energy is 

i^bo„„d = , ^ (911) 

2m(a^ — Ry 



Finding the scattering states: For positive energies we look for scattering states. The scattering solution in the 
outside region is 

■^{r) = Ac-'^'^'' - Be'^"'' = ^(g-^'^^'- - e'^aogifeEr-j ^ Csin(fc£;r + <5o) (912) 
where 5o is the phase shift. The matching with the interior solutions gives the equation 

kEC0i{kER + So) = ko (913) 
This equation can be written as 

tan((5o-(5(r) = ^ (914) 
ko 
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where 5^ = —ksR is the solution for "hard sphere scattering". Thus the exphcit solution is 

So = 5^5° +arctan(^) (915) 
fco 



== [32.2] Low energy scattering 

It is clear that in practical problems the boundary r = i? is an arbitrary radius in the "outside" region. It is most 
natural to extrapolate the outside solution into the the region r < i? as if the potential there is zero. Then it is 
possible to define the logarithmic derivative kg of the extrapolated wavefunction at r = 0. This function contains the 
same information as fcg, while the subtlety of fixing an arbitrary R is being avoided. 

At low energies the logarithmic derivative at the origin is fco = {{^/ko) — R)^^ ■ For a sphere of radius R we have 
fco = oo and therefore fco ~ —l/R. More generally it is customary to expand fc'o as follows: 

ko{E) = kECot{6o{E)) ^ -- + ir,fc| + ... (916) 

a, 2 

The parameter is known as the scattering length. For a sphere of radius R we have ~ R while more generally we 
have = R — (1/fco). From the previous section we recall that a bound state exists only if > i?, which is feasible 
only for an attractive potential. 

We see that at low energies the phase shift can be expressed using the scattering length as Sq = —kEa^- In a later 
section we shall see that in 3D s-scattering this means that the total cross section is 

cTtotai = ■r2-sin2(5o) « (917) 

A delta function V{x) = u5^{x) in 3D has a zero total cross section. Still from the first order Born approximation 
for the T matrix we get as an artifact 

(Ttotai = 47r[— (918) 

It is therefore common to describe an s-scatterer which is characterize by a scattering length ^ i?, as a negative 
delta scatterer with u = —2TTa^/m. One should be very careful with the latter point of view. 

== [32.3] Inelastic scattering by a delta scatterer 



We consider the following scattering problem in one dimensions [http://arxiv.org/abs/0801.1202]: 

n = + g<5(x) + Kc.tte.„ (919) 

zm 

The scatterer is assumed to have energy levels n with eigenvalues It should be clear that inelastic scattering of 
a spinless particle by a multi level atom is mathematically equivalent to inelastic scattering of a multi level atom by 
some static potential. Outside of the scattering region the total energy of the system (particle plus scatterer) is 

£ = ek+En (920) 

We look for scattering states that satisfy the equation 



(921) 
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The scattering channels are labeled as 

n={no,n) (922) 
where uq — left, right. We define 



kn = ^2m{£ - En) for n e open (923) 
an = ^/-2m{£ - En) for n G closed (924) 

later we use the notations 

Vn = fcn/m (925) 
Un = an/m (926) 

and define diagonal matrices v = diag{w„} and u ~ diag{u,i}. The channel radial ftmctions are written as 

R{r) = Ane-'''"'' + B„e+'*="'^ for n e open (927) 
R(r) = C„e-""'' for n e closed (928) 

where r = Next we derive expression for the 2N x 2 A'' transfer matrix T and for the 2N x 2 A'' scattering matrix 
S, where N is the number of open modes. The wavefunction can be written as 

*(r,no,g)=5]i?„„,„(r)x"(g) (929) 

n 

The matching equations are 

^-(0, right, Q) = ^-(0, left, Q) (930) 
[*'(0, right, Q) + *'(0, left, Q)] = 0*(0, Q) (931) 

zm 

The operator Q is represented by the matrix Qnm that has the block structure 



Qnm = ( ^™ ) (932) 



iQuv Q 

For sake of later use we define 

M„™=fi^™? (933) 

The matching conditions lead to the following set of matrix equations 

Ar + Br = Al + Bl (934) 

Cr = Cl (935) 

-iv{AR-BR + AL-BL) = 2Q,,{Al + Bl) + 2Q,uCl (936) 

-uiCR + CL) = 2Quv(.Al+Bl) + 2QuuCl (937) 
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from here we get 

Ar + Br = Al + Bl (938) 

Ar-Br + Al-Bl - i2{v)-^Q{Al + Bl) (939) 

where 

Q = Qvv- Qvu , \ , Quv (940) 

The set of matching conditions can be expressed using a transfer matrix formaUsm that we discuss in the next 
subsection, where 

M ^ -^Q^ = Mvv ~ Mvu , , \^ Mu. (941) 



== [32.4] Finding the S matrix from the transfer matrix 

The set of matching conditions in the delta scattering problem can be written as 

Br\ rr^fAL 



where v4„ = yJv^An and i?„ — y/v^Bn- The transfer 2N x 2N matrix can be written in block form as follows: 
T++ r+_\ _ fl~iM ~iM 



^=1/: T::) = [ ^M l + ^M^ (^^3) 
The S matrix is defined via 

Bl\ a(AL 



and can be written in block form as 
A straightforward elimination gives 



TriT_+ TZI \ _ /(l + iX)"^-l (l + iTW)"^ 



Now we can write expressions for Sr and for St using the M. matrix, 
1 



(946) 



. ^=l-iM- + iM' + ... (947) 
1 + iM 

Sr ^ St- 1 (948) 
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== [32.5] Elastic scattering by a delta in a waveguide 

The elastic scattering of a spinless particle by a regularized delta scatterer in a waveguide is mathematically the same 
problem as that of the previous section. We have 

Q = c5{v - yo) (949) 

for which 



Qnm = c J XnSiy - yo)Xmdy = c x"(t/o)x'"(yo) (950) 
Given the total energy we define 

/ 1\.T \ 

(951) 



~ \Muv Muu 

Regularization means that one impose a cutoff on the total number of coupled channels, hence M is a finite (truncated) 
matrix. Using the formula for inverting a matrix of the type 1 — aa^ , we first obtain A4 and then obtain 

(952) 



1 + i trace[7\/u„] + trace[Af„„] 



Let us consider what happens as we change the total energy: Each time that a new channels is opened the scattering 
cross section becomes zero. Similarly, if wc remove the regularization we get zero scattering for any energy because 
of the divergent contribution of the closed channels. 



^^^^= [32.6] Analysis of the cavity plus leads system 

Consider a cavity to which a lead is attached. The Fisher-Lee relation established a connection between the S = l — iT 
matrix and the Green function G. It can be regarded as a special variant oi T — V + VGV. The channels index is a 
and the cavity states are n. There is no direct coupling between the channels, but only a lead-cavity coupling W. On 
the energy shell we need the matrix elements Wan = {E, a\W\n). Consequently we expect an expression of the form 

S = 1 iT = 1 - iWGW^ (953) 

From perturbation theory we further expect to get 

^ " E-H,^ + i{W^W/2) ^^^^^ 

where Ti;,, is the truncated Hamiltonian of the interior region. The latter is known as the Widenmiller formula, and 
can be regarded as the outcome of R matrix theory which wc detail below. 

The standard derivation of the Fisher-Lee relation goes as follows [Datta] : We place a source at the lead and use the 
S matrix to define the boundary conditions on the surface x{s) of the scattering region. We solve for the outgoing 
amplitudes and find that 

G{s\so) = ^ ^ -^X^is) {S - l)ab -^x'i-so) (955) 



This relation can be inverted: 



Sab = Sab - WvaVb I x"' {s) G(s|so) X (so) dsdsQ (956) 
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The definition of W is implied by the above expression. In the next section we find it useful to define a complete set 
(p*-"-* (x) of cavity wavefunctions. Then the integral over s becomes a sum over n, and we get 

Wan - [ x''{s)v^"\x{s)) ds (957) 



The R matrix formalism opens the way for a powerful numerical procedure for finding the S matrix of a cavity-lead 
system. The idea is to reduce the scattering problem to a bound state problem by chopping the leads. It can be 
regarded as a generalization of the one-dimensional phase shift method where the outer solution is matched to an 
interior solution. The latter is characterized by its log derivative on the boundary. In the same spirit the R matrix 
is defined through the relation 

*„)./H,„,.')3*(.')<i.' (958) 

If we decompose this relation into channels we can rewrite it as 

'fa = Y^ Rabd'i'b (959) 

b 

Expressing and as the sum and the difference of the ingoing and the outgoing amplitudes Aa and Ba, one 
finds a simple relation between the R matrix and the S matrix: 

"^"^ y/kah ( 1 + S') ^^^^^ 

The inverse relation is 

^ ^ 1 + iVkRVk 
1 - iVkRy/k 

From the Green theorem it follows that 

H(s,s') = -|^G^(s'|s) (962) 

where is the Green function of the interior with Neumann boundary conditions on the surface of the scattering 
region. If wc find a complete set of interior eigenfunctions then 

and consequently 



ka J E — En \ \/kb 

The corresponding result for the S matrix is obtained by expanding (1 -I- x) /{I — x) — 1 + 2(...) with the identification 
of (...) as the diagrammatic expression of the resolvent. The result is known as the Weidenmiller formula: 



which agree with the Fisher Lee relation. 
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[33] Scattering in a spherical geometry 



Of special interest is the scattering problem in 3D geometry. We shall consider in this lecture the definitions of the 
channels and of the S matrix for this geometry. Later analyze in detail the scattering by a spherically symmetric 
target. In the latter case the potential V{r) depends only on the distance from the origin. In order to find the 
scattering states we can perform separation of variables, where £ and m are good quantum numbers. In the {£, m) 
subspace the equation for the radial function u{r) = rR{r) reduces to a one dimensional Schrodinger equation on the 
< r < oo axis. To avoid divergence in R{r) we have to use the boundary condition u{0) = 0, as if there is an infinite 
wall at r = 0. 

Most textbooks focus on the Coulomb interaction V{r,6,(p) = —a/r for which the effective radial potential is 
V^ii{r) = — a/r + /3/r'^. This is an extremely exceptional potential because of the following: 

• There is no centrifugal barrier. 

• Therefore there are no resonances with the continuum. 

• It has an infinite rather than a finite number of bound states. 

• The frequencies of the radial and the angular motions are degenerate. 

• Hence there is no precession of the Kepler ellipse. 

We are going to consider as an example scattering on a spherical target which we call cither "sphere" or "well". 
The parameters that characterize the sphere are its radius a, the height of the potential fioor V, and optionally the 
"thickness" the shielding U. Namely, 



Disregarding the shielding the effective radial potential is V^if{r) ~ VQ{a — r) + We consider first hard sphere 

(Vb = oo) and later on the scattering on a spherical well (Vb < 0). The effective potential for 3 representative values 
of V is illustrated in panels (a)-(b)-(c) of the following figure. In panels (d) we illustrate, for sake of comparison, the 
effective potential in case of a Coulomb interaction. 



VQ{a-r) + US{r-a) 



(966) 




a 



a 



2ma 




Classically it is clear that if the impact parameter 6 of a particle is larger than the radius a of the scattering region 
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then there is no scattering at all. The impact parameter is the distance of the particle trajectory (extrapolated as 
a straight line) from form the scattering center. Hence its angular momentum is £ = mvEb. Thus the semiclassical 
condition for non-negligible scattering can be written as 

b<a <^ £<kEa <^ ^ < E (967) 

2ma^ 

The last version is interpreted as the condition for reflection from the centrifugal barrier (see Fig). In the channels 
where the semiclassical condition is not satisfied we expect negligible scattering, (no phase shift). This semiclassical 
expectation assumes that the "forbidden" region is unaccessible to the particle, hence it ignores the possibility of 
tunneling and resonances that we shall discuss later on. Whenever we neglect the latter possibilities, the scattering 
state is simply a free wave which is described by the spherical Bessel function ji{kEr). For fc^jr <^ £ this spherical 
Bessel function is exponentially small due to the centrifugal barrier, and therefore it is hardly affected by the presence 
of the sphere. 



[33.1] The spherical Bessel functions 



Irrespective of whether the scattering potential is spherically symmetric or not the Schrodinger equation is separable 
outside of the scattering region. This means that we can expand any wavefunction that satisfies = E"^ in the 
outside region as follows: 

vI/(a;) = ^i?,,™(r)r^™(0,V5) (968) 

The channel index is a = {£, m) while Q = {9, cp) is analogous to the s of the 2D lead system. The are the channel 
functions. In complete analogy with the case of ID geometry we can define the following set of functions: 

h+{kEr)^e''"''^ (969) 
V(fcBO^e-'''=^'- 
ji{kEr) <-> sin(fcsr) 
ni{kEr) <-> cos{kEr) 

Note that the right side equals the left side in the special case i = 0, provided we divide by r. This is because the 
semi-ID radial equation becomes literally the ID Schrodinger equation only after the substitution i?(7') = u{r)/r. 

In what follows we use Messiah convention p. 489. Note that other textbooks may use different sign convention. The 
relation between the functions above is defined as follows: 

/i± ^ ni{kr)±ije{kr) (970) 

We note that the jn(^) are regular at the origin, while the ri„(r) are singular at the origin. Therefore only the former 
qualify as global "free waves". The I = functions are: 

, sin(fcr) 

Mkr) = — ^ (971) 
kr 

cos(fcr) 

no{kr) = — ; 

kr 
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The asymptotic behavior for kr ^ £+1 is: 

ni{kr) 
ji{kr) 



kr 

cos(fcr - f £) 
kr 

sa\{kr - f ^) 



(972) 



kr 



The short range kr <C ^'+1 behavior is: 



ni{kr) K {2l-l)\\ 
{krY 



ji{kr) 



(2^ + 1)!! 



e+i 



2{2l + 3) 



2(2/- 1 
(fcr)2 + . 



■{kr) 



(973) 



== [33.2] Free spherical waves 

On the energy shell we write the radial wavefunctions as 

Rem{r) = A,™i?^^^"--(r) - i?„„i?^-^'"^+(r) (974) 
where in complete analogy with the ID case we define 

^B/™,±(^) = hfikEr) (975) 



The asymptotic behavior of the spherical Hankel functions is (=pi)^e**'"'^''/ [kEr). From this follows that the flux of the 
above radial functions is indeed normalized to unity as required. Also the sign convention that we use for /j^'.^™'±(7.) 
is appropriate because the free waves are indeed given by 

I^BA") ^ [i?'™-(r)-i?'™+(r)]y^™(0,(^) = -i-^2jf(fcBr)r^™(0,(^) (976) 



This spherical free wave solution is analogous to the planar free wave 

|0iJ,J2N| ^ ^^tkEnn-3_ If decide (without loss of 
generality) that the planar wave is propagating in the z direction, then we can use the following expansion in order 
to express a planar wave as a superposition of spherical waves: 



^(2^+1) {iYPi{cos{e))j,{kEr) (977) 



We note that we have only m = basis functions because there is no dependence on the angle ip. In different phrasing 
one may say that a plane wave that propagates in the z direction has — Q angular momentum. Using the identity 



^ =\/^^^Kcos(0)) (978) 



we can write 



gifcE. = y ^(2/ + l)7r (z)^+i^ 0^.^-'"(r, 0, ^) (979) 



kE 
i,m=0 
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which makes it easy to identify the ingoing and the outgoing components: 

(e^'^)i„,oi„, - A^rnY'^^i^^^ v)R'"'^{r) (980) 



tm 



where 



-B,™ = Al„, = Srn.O VW+Tj^ {iY+^ (981) 

Kb 

This means that the incident flux in channel {i,0) is simply 

VE (982) 



^incident 



^(2^+1) 



The expression in the square brackets has units of area, and has the meaning of cross section. The actual cross section 
(see next section) contains an additional factor that express how much of the incident wave is being scattered. The 
maximum fraction that can be scattered is 400% as explained in the previous section. 

== [33.3] The scattered wave, phase shifts, cross section 



In the past we were looking for a solution which consists of incident plane wave plus scattered component. Namely, 

^(r) =c'''"' + f(Q)^—- (983) 
r 

From the decomposition of the incident plane wave it is implied that the requested solution is 

= E ^^™^'™(^' ^)R''^-{r) (984) 
*o...o.„. = - E BirnY'"'{e, ¥>)i?^™+(r) 



where 



A,™ = (5™,o V(2Z + 1)^ («)'+' (985) 

Bern — Siml'm' A/iim' 

If we are interested in the scattered wave then we have to subtract from the outgoing wave the incident component. 
This means that in the expression above we should replace Sim,e'm' by —iTim,e'm' ■ 

Of major interest is the case where the target has spherical symmetry. In such case the S matrix is diagonal: 

Slm,l'm'{E) = buibmra'^^^'^ (986) 

Tini,i'7n'{E) = -(5«'5„„„' e'** 2sin((5^) 
Consequently we get 

= - Tu ^{U + l)7r (z)^ Y'\6, ip)hjir) ^ (987) 
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with 



(988) 



It follows that 



= / Umi'dn = ^^(2£+l)|T«|2 



-^^(2£+l)|sin(<S, 



(989) 



By inspection of the scattered wave expression we sec that the scattered flux in each of the (£, m=0) channels can be 
written as if = ctpve, where the partial cross section is 



(2^ + i)rr 



(2£+l)|sin(5,)p 



477 



(990) 



It is important to realize that the scattered flux can be as large as 4 times the corresponding incident flux. The 
maximum is attained if the scattering induces a 7r/2 phase shift which inverts the sign of the incident wave. In such 
case the scattered wave amplitude should be twice the incident wave with an opposite sign. 



== [33.4] Finding the phase shift from T matrix theory 

We can get expressions for T« = Vee + (VGV)ii using the Born expansion, and hence to get approximations for the 
phase shift Si and for the partial cross section a£. The derivation is the same as in the quasi ID £—0 case. The 
flux-normalized free waves are 

10^'") = -i^2MkEr)Y'^{e, ip) (991) 



The first order result for the phase shift is 
2 f°° 

JB=,n « _ / V{r) {kErji{kEr)f dr (992) 
riVE Jo 

while in the vicinity of resonances we have 

6e = d^ - arctan [ J"''^^ ) (993) 

\E — Er J 

where is a slowly varying "background" phase that represent the non-resonant contribution. 

There are two physical quantities which are of special interest and can be deduced from the phase shift. One is the 
time delay that we already have discussed in the quasi one dimensional context. The other is the cross section: 

A-TT 

ae{E) = {2£ + l)-^\sm{S,)\^ (994) 

For resonant scattering the " line shape" of the cross section versus energy is typically Breit-Wigner and more generally 
of Fano type. The former is obtained if we neglect , leading to 



(995) 
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We see that due to a resonance the partial cross section can attain its maximum value (the so called "unitary 
limit"). If we take 5f into account we get a more general result which is known as Fano line shape: 

a, = (2£+ l)i|[sin(5r)]'^^^ (996) 

where e = {E — Er)/{T/2) is the scaled energy, and q ~ — cot((5^) is the so called Fano asymmetry parameter. 
The Breit-Wigner peak is obtained in the limit q ^ oo, while a Breit-Wigner dip (also known as anti- resonance) is 
obtained for q = 0. 

===== [33.5] Scattering by a soft sphere 

Assuming that we have a soft sphere (|T^| is small), wc would like to evaluate the phase shift using the Born approx- 
imation. Evidently the result makes sense only if the phase shift comes out small {6e <C 1). There arc two limiting 
cases. If ka <^ £+1 we can use the short range approximation of the spherical Bessel function to obtain 

^ "'^a\kaf'+' (997) 



{2e + 3) [{21 + ly.if 

On the other hand, if ka ^ £+1 we can use the "far field" asymptotic approximation of the spherical Bessel function 
which implies [krji{kr)\^ « [sin(A:r)]^ « 1/2. Then we obtain 

,5^° « ~^Va = -'^a'[ka)-^ (998) 
nvE n 



[33.6] Finding the phase shift by matching 



We would like to generalize the quasi one-dimensional matching scheme to any i. First we have to find the radial 
function for < r < a and define the logarithmic derivative 

- f^^l (999) 



R{r) dr 

Note that we use here R{r) and not u{r) and therefore in the t ~Q case we get /cq = ~ (l/^)- The solution in the 
outside region is 

R{r) = Ahjiksr) - Bh+{kEr) (1000) 
= A{h-{kEr)-e'^^'h+{kEr)) 
= C{cos{Si)je{kEr) + sm{6e)ne{kEr)) 

We do not care about the normalization because the matching equation involves only logarithmic derivatives: 

cos((5)j' -I- sin((5)n' 



cos{d)j + sm{S)Ti 
solving this equation for tan((5f) we get 



(1001) 



tan(J,) = ™M(^4:i^4M (1002) 
kini[ka) — kEn^yka) 
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which can also be written as: 

\h+ ) h-{h'+/h+)kE 

In order to go from the first expression to the second note that if tan((5) = a/h determines the phase of a complex 
number z ^ a + ib then (a + ib)/{a — it) = e'^"^. Thus we have expressions for the phase shift given the log derivative 
of the regular solution on the boundary of the scattering region. 

^^^^= [33.7] Scattering by a hard sphere 



The phase shifts for a hard sphere (V — > oo) can be found from the phase shift formula of the previous section using 
ki oo, leading to 

tan(r) = -^^^ (1004) 
or equivalently 

(1005) 



i25~ ^ h [kEa) 



h+(kEa) 

From the first version it is convenient to derive the result 
1 



(2£+l)!!(2^-l)!! 



{kEaf'^+'^ for ^ > fca (1006) 



where we have used the short range expansions ji cx and oc l/r^~^^. From the second version it is convenient to 
derive the result 

= -argih+ (kEa)) Ri ~{kEa~^i) for ^ < fca (1007) 
where we have used the asymptotic expression h{r) ^ (— *)^ e^^'^^' /r. 

Small hard sphere: In the case of a small sphere {kEa ^ 1) we have 1 ^ (5o ^ (^i ^ '^2 ■ ■ • and the £ = cross 
section is dominant 

5q = -{kEo) (1008) 
Hence 

'^total = p E + 1) (-5^) « T2 sin' (^o) « (1009) 
;=o 

We got a (T that is 4 times bigger than the classical one. The scattering is isotropic because only the ^ = component 
contributes to the scattered wave. 

Large hard sphere: Now we turn to the case of a large sphere (fc^a ^ 1). We neglect all 5i for which I > kEa. For 
the non- vanishing phase shifts we get 



Se= -{kEa) for £ = 0,2, 4,. 
df,^ _(fc^a) + | for £ = 1,3, 5,. 



(1010) 
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hence 



47r 



total 



p 1 (2Z + l)sin2(fcBa) + ^ (2/ + 1) cos2(fc£;a) 

47r '^^^ ,1 27r , 27r,, ^ 2 

^ (2^ + 1)2 "^F/ 2:rdx=^(fc^,af = 2^2 



(1011) 



f=0,l...fca 



This time the resuh is 2 times the classical result. The factor of 2 is due to forward scattering which partially cancels 
the incident wave so as to create a shadow region behind the sphere. 



== [33.8] Summary of results for scattering on a sphere 

We have discussed scattering on a " hard sphere" , on a " deep well" , and finally on a " soft sphere" . Now we would like 
to put everything together and to draw an (a, V) diagram of all the different regimes. 




Let us discuss the various regimes in this diagram. It is natural to begin with a small sphere. A small sphere means 
that the radius is small compared with the De-Broglie wavelength {ka < 1). This means that disregarding resonances 
the cross section is dominated by s-scattering {£ = 0). On the one hand we have the Born approximation and on the 
other hand the hard sphere result: 



CHard 
^0 



mV 



(ka) 



(1012) 



-(fca) 



Thus we see that the crossover from "soft" to "hard" happens at ^ ~ h^/{ma^). What about ^ < 0? It is clear 
that the Born approximation cannot be valid once we encounter a resonance (at a resonance the phase shift becomes 
large). So we are safe as long as the well is not deep enough to contain a quasi bound state. This leads to the sufficient 
condition — ?i^/(ma^) < < 0. For more negative V values we have resonances on top of a "hard sphere" behavior. 
Thus we conclude that for ka < I soft sphere means 



\V\ < nyima^) 



(1013) 
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We now consider the case of a large sphere {ka ^ 1). In the absence of resonances we have for small impact parameter 
{£ <^ ka) either the Born or the hard sphere approximations: 



CBon 

Of 



V 

hvE 

0{l) 



(1014) 



Also here we have to distinguish between two regimes. Namely, for fca ^ 1 soft sphere means 



\V\ < hvE/a (1015) 

If this condition breaks down we expect to have a crossover to the Hard sphere result. However, one should be aware 
that if V < E, then one cannot trust the hard sphere approximation for the low £ scattering. 
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QM in Practice (part I) 



[34] Overview of prototype model systems 



Below we consider some simple one-particle closed systems in 0D/1D/2D/3D. Later we further illuminate and analyze 
some prototype problems. 

Quasi OD systems: These arc N site systems. The simplest is the two site (TV = 2) system. If wc want to have 
non-trivial topology we must consider at least a 3-site {N = 3) system. Next we can construct A^-site chains, or rings 
or networks or so called tight-binding arrays. If we regard these models as a coarse grained description of motion in 
free space, then the hopping amplitude per unit time between two nearby sites i and j is determined by the mass of 
the particle (m) and by the hopping distance (a): 



Quasi ID systems: A ID segment can be regarded as the continuum limit of quasi OD chain. If the particle in 
confined between two ends, the model is know as "infinite well" or as ID box with hard walls. If there are periodic 
boundary conditions, the model is know as "ID ring". Several "ID boxes" can be connected into a network: each ID 
segment is called "bond", and the junctions are called "vortices". 

Soft walls: If the confining potential is not stepwise, but changes smoothly, then we call it "well with soft walls" . The 
simplest example is the harmonic oscillator where the potential is V{x) = . The prototype example for anharmonic 
potential with a dividing separatrix in phase space is the pendulum where Vix) — cos(a;). 

2D systems: Genuine 2D systems are chaotic, and typically called "billiards". These serve as prototype models for 
"quantum chaos" studies. But if the system has (say) rectangular or cylindrical symmetry then the problem reduces 
back to ID. 

3D systems: Genuine 3D systems are rarely considered. Typically people consider models with spherical symmetry 
(e.g. the Hydrogen atom) where the problem is reduced to ID, while the perturbations that do not respect the high 
symmetry are treated within a quasi OD framework (a few-level approximation). 

Interacting particles: The treatment of non- interacting system with Bosons or with Fermions reduces to one- 
particle analysis. In order to have new physics wc have to take into account interactions. The prototype models 
(Kondo / Hubburd) assume that the interaction is of the type t/ oc (n— l)n where n = a|ai is the simplest non-trivial 
possibility in the case of Bosons, while n = a\ai + a|a2 is the simplest non-trivial possibility in the case of Fermions. 

System- bath models: If we have few distinguished degrees of freedom that we call "system", and all the other 
degrees of freedoms form a weakly coupled background that we call "environment", then there is a sense in using 
a system-bath model in order to describe the (reduced) dynamics of the system. Typically the environment is 
characterized by its fluctuations, and then modeled as a large collection of harmonic oscillators. 



The simplest non-trivial network that we can imagine is of course a two site system. Without loss of generality we 
can write the Hamiltonian as 



2 



(1016) 



2ma2 



[35] Quasi OD networks 




(1017) 



where il = (2c, 0, e). We can define the occupation operator of (say) the first site as 



A^i ^ |1)(1| ^ 2(1 + a.) (1018) 
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The definition of the current that comes out from this site is impHed by the "rate of change formula", namely 



(1019) 



So we have the continuity equation 



dt 



(M) = -(Xi^2), 



(M) = IVi 



(1020) 



The generalization from a two site system to an N site system is straightforward. Assuming that the one-particle 
Hamiltonian in the standard (position) basis is represented by a matrix iJy , we define occupation operators and 
current operators as follow: 



T 



-~t\\j)H,M'\i)H^M 



(1021) 
(1022) 



so as to have the continuity equation 



(1023) 



Of particular important is the the standard tight binding model where we have a chain of N sites with near neighbor 
hopping. For simplicity we assume periodic boundary conditions, which is like saying that we have a closed ring. We 
also use length units such that the sites are positioned at a; = 1, 2, 3.... The Hamiltonian can be written as 



-c{D + D-^) + V{x) = -2ccofi{p) + V{x) 



(1024) 



where D is the one site displacement operator. The definition of the velocity operator is implied by the "rate of 
change formula" , namely 



i[H,x\ = ic{D-D-^) = 2csin(p) 



(1025) 



Note that for small velocities we have linear dispersion relation v ~ (l/m)p, where the mass is m = l/(2c). If distance 
between the sites were a rather then unity one would conclude the identification c = l/(2ma^). 
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[36] Bloch oscillation and and Landau-Zener dynamics 
== [36.1] Bloch dynamics of a two level system 



The most general Hamiltonian for a particle with spin ^ is: 

n = n-s^ Q^s^ + QySy + n,s^ (i026) 

Where S = \(7. The evolution operator is: 

U{t) = e~'*^ = e-'(^*)-^ = R{${t)) (1027) 

where = fit. This means that the spin makes precession. It is best to represent the state of the spin using the 
polarization vector M. Then we can describe the precession using a classical-like, so called "Bloch sphere picture". 
The formal derivation of this claim is based on the relation between M{t) and p{t). We can write it either as 
M{t) = trace{ap{t)) or as an inverse relation p{t) — (1 + A/(t) •cr)/2. In the former case the derivation goes as 
follows: 

Miit.) = trace(crj/9(t)) = trace(cri(i)p) (1028) 
= trace((i?"Vii?)p) = trace{{Rfjaj)p) = i?f^($(i)) ^^(0) 

where we have used the evolution law p{t) = U{t)p{Q)U{t)~^ and the fact that is a vector operator. 

We notice that the evolution of any system whose states form a dim=2 Hilbert space can always be described using 
a precession picture. The purpose of the subsequent exposition is (i) to show the power of the precession picture as 
opposed to diagonalization; (ii) To explain the notion of small versus large perturbations. With appropriate gauge of 
the standard basis the Hamiltonian can be written as: 

n=(^^{^ J^^^ = ^<7.+ca, = d-S (1029) 

Where ~ (2c, 0, e). In the case of a symmetric system (e = 0) we can find the eigenstates and then find the evolution 
by expanding the initial state at that basis. The frequency of the oscillations equals to the energy splitting of the 
eigen-energies. But once (e ^ 0) this scheme becomes very lengthy and intimidating. It turns out that it is much 
much easier to use the analogy with spin 1/2. Then it is clear, just by looking at the Hamiltonian that the oscillation 
frequency is 



n = v/(2c)2 + £2 (1030) 

and hence the eigenenergies are E± = ±57/2. Furthermore, it is clear that the precession axis is tilted relative to the 
z axis with an angle 

6*0 = arctan(2c/e) (1031) 

Accordingly the eigenstates can be obtained from the t and from the | states by rotating them an angle 9o around 
the y axis: 



cos(0o/2)\ f-sMeo/2) 
sin(0o/2) r ' ^ V cos(0o/2) 



(1032) 
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Let us assuming that initially the system is in the "f state, and let us define P{t) as the probability to be found after 
time t in the same state. So initially we have P{0) =- 1. We can easily find an explicit expression for P(t) in a later 
time without having to do any diagonalization of the Hamiltonian. Rather we exploit the well know result for a 
spin 1/2 rotation around a tilted axis: 

Pit) = |(T |e-^^*-^^| T)P = l-sin2(0o)sin2 (^^^ (1033) 

This result is called Rabi Formula. We see that in order to have nearly complete transitions to the other site after 
half a period, we need a very strong couphng (c » e) . In the opposite limit (c ^ e) the particle tends to stay in the 
same site, indicating that the eigenstates are barely affected. 

We note that the Rabi Formula can be optionally derived by considering the precession of the polarization vector M{t). 
This procedure is on the one hand more illuminating, but on the other requires better geometrical insight: One should 
realize that the inclination angle of M{t) oscillates between the values 9 = and 9 = 29o- Therefore Mz{t) oscillates 
between the maximal value = 1 and the minimal value = cos{29q). Using the relation P{t) = (1 + Mz{t))/2 
one concludes that P{t) oscillates with the frequency Q. between the maximal value 1 and the minimal value (cos(6'o))^ 
leading to the Rabi Formula. 



[36.2] Landau-Zener dynamics 



A prototype adiabatic process is the Landau-Zener crossing of two levels. The Hamiltonian is 

n = 2U -"0 ^ 2^^-3 + 2-1 (1034) 

Let us assume that the system is prepared at t = —00 in the lower ("-") level (which is the "up" state). We want to 
find what is the probability to find the system at t = 00 in the upper ("+") level (which is again the "up" state!). 
The exact result is know as the Landau-Zener formula: 



Poo{+\-) = Pood IT) = CXp 



2 a 



(1035) 



For a "fast" (non-adiabatic) process the problem can be treated using conventional (fixed basis) perturbation theory. 
This gives the equation 



dci (t) . K 

dt 2 ^ 



1 2 
-^-at 



c^(t) (1036) 



Setting C|(t) t-^ 1 and performing the time integration, one obtains the first order estimate for the probability l~p 
to make a transition form the "up" to the "down" state. This leads to p = 1 — {tt/2)[k^ /a] in accordance with the 
Landau-Zener formula. Below we assume that the process is adiabatic, and explain how the general result is obtained. 

In order to analyze the Landau-Zener transition we write the Schrodinger equation in the adiabatic basis. The 
instantaneous eigenenergies are 

En = (1037) 
and the associated eigenstates are 
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where 



n{t) 
e{t) 



arctan [n/ {at)) . 



(1039) 
(1040) 



Note that d=TT at t= — oo evolves to 9=0 at t=oo. In our choice of gauge the adiabatic eigenstates have real amplitudes, 
and therefore the Berry "vector potential" comes out zero. At the same time the non-zero element of the perturbation 
matrix W„m is 



1 



-0"3 



+ - 



. asin(6l) 



. an 



1 a 

2 k 



1 



(at/«;)2 + 1 



(1041) 



Following the standard procedure as in time-dependent perturbation theory we substitute 



a±(i) 



c±{t) exp 



E±dt' 



(1042) 



Using the rescalcd variable r = at/K, and changing to r = sinh(z), the expression for the dynamical phase is 

•2 1/ 1 



$(i) = _ / drVr^ + l 
a 



— X - z H — sinh(2z' 
a 2 \ 2 ^ ' 



(1043) 



Thus for the amplitude c_|_(i) we get the equation 



dc+(t) 
dt 



1 a 

2 k 



1 



(at/K)2 -t- 1 



c-{t) 



(1044) 



Starting with the zero order solution (c_(<) i-^ — 1), and performing the time integration, one obtains the first order 
adiabatic estimate for the amplitude cj^{t), leading to 



r2 + 1 



cosh(z 



(1045) 



In order to evaluate this integral once can use contour integration. The explicit expression for the phase $ as a 
function oi z = x + iy is 



^{z) 



^ 1 

a ^ 2 



a: 4- - sinh(2a;) cos(2j/) j "'^ * ^ 2 '^'^^^(^^) ^^'^(22/) 



(1046) 



The original contour of integration is ?/ = 0, but we would like to deform it into the complex plane so as to get rid 
of the rapid oscillations of the phase factor, and have instead a smooth monotonic variation. The deformed contour 
is displayed in the figure. The phase is pure imaginary along the curves C_ and C+. At zq = Q + i{'K /2) we have 
$ = I A)^^ I a, while coshz w i{z — zq). Consequently we have a pole leading to the same exponent as in the 
Landau-Zener result. The prefactor come out (7r/3)2. The derivation of the correct prefactor, which is unity, requires 
either exact solution of the differential equation using parabolic cylinder functions, or otherwise iterations of the above 
procedure to infinite order. 



[36.3] Adiabatic transfer from level to level 



A practical problem which is encountered in Chemical Physical applications is how to manipulate coherently the 
preparation of a system. Let us as assume that an atom is prepared in level \Ea) and we want to have it eventually at 
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level \E]y). We have in our disposal a laser source. This laser induce AC driving that can couple the two levels. The 
frequency of the laser is uj and the induced coupling is fl. The detuning is defined a,s S = uj — {Ei, — Ea). Once the laser 
is turned "on" the system starts to execute Bloch oscillation. The frequency of these oscillation is i7 = (51^ + (5^)^/^. 
This is formally like the coherent oscillations of a particle in a double well system. Accordingly, in order to simplify 
the following discussion we are going to use the terminology of a site-system. Using this terminology we say that with 
a laser we control both the energy difference 5 and the coupling f2 between the two sites. 

By having exact resonance [5 ~ 0) we can create "complete" Bloch oscillations with frequency 51. This is formally 
like the coherent oscillations of a particle in a symmetric double well system. In order to have 100% transfer from 
state \Ea) to state \E}j) wc have to keep i5 = for a duration of exactly half period {t = tt/VI). In practice this is 
impossible to achieve. So now we have a motivation to find a practical method to induce the desired transfer. 

There are two popular methods that allow a robust 100% transfer from state \Ea) to state \Ei,). Both are based on an 
adiabatic scheme. The simplest method is to change 5 gradually from being negative to being positive. This is called 
"chirp". Formally this process is like making the two levels "cross" each other. This means that a chirp induced 
transfer is just a variation of the Landau-Zener transition that we have discussed in the previous section. 

There is another so called "counter intuitive scheme" that allows a robust 100% transfer from state \Ea) to state ji?;,), 
which does not involve a chirp. Rather it involves a gradual turn-on and then turn-off of two laser sources. The first 
laser source should couple the (empty) state \Eb) to a third level \E(.). The second laser source should couple the (full) 
state \Ea) to the same third level \Ec)- The second laser is tuned on while the first laser it turned off. It is argued in 
the next paragraph that this scheme achieves the desired transfer. Thus within the framework of this scheme it looks 
as if a transfer sequence a i— > c i— s- 6 is realized using a counter intuitive sequence c i— > 6 followed by a i— > c. 

The explanation of the "counter intuitive scheme" is in fact very simple. All we have to do is to draw the adiabatic 
energy levels E-{t) and E^it) and E+{t) as a function of time, and then to figure out what is the "identity" of (say) 
the middle level at each stage. Initially only the first laser in "on" and therefore \Eb) and \Ec) split into "even" 
and "odd" superpositions. This means that initially i?o(0 corresponds to the full state \Ea)- During the very slow 
switching process an adiabatic evolution takes place. This means that the system remains in the middle level. At the 
end of the process only the second laser is "on" and therefore, using a similar argumentation, we conclude that E^it) 
corresponds to the state \Eb). The conclusion is that a robust 100% transfer from state \Ea) to state \Eb) has been 
achieved. 

[37] A few site system with Fermions 
== [37.1] A two-site system with one particle 



The problem of "positioning a particle of spin i in a specific location" is formally identical to the problem of "putting 
a particle in a two-site system" . In both cases the system is described by a two-dimensional Hilbert space dim = 2. 
Instead of discussing an electron that can be either " up" or " down" , we shall discuss a particle that can be either in 
site 1 or in site 2. In other words, we identify the states as: |1) = | t) and |2) = | [). 



Il> I2> 

The standard basis is the position basis |a; = 1), \x ~2). The k states are defined in the usual way. We have the even 
and the odd states with respect to reflection: 

|fc = 0) = |+) = ^(|1) + |2)) (1047) 
\k^^) = \-) = ^(Il>-|2» 



Note the formal analogy with spin 1/2 system: 
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1+) = I represents spin i polarized right. 
| — ) = I ^) represents spin ^ polarized left. 

The representation of the operator x is: 



(1048) 



where 



1 

-1 



is the third Pauli matrix. The translation operator is actually a reflection operator: 



R = D 



1 

1 



(1049) 



where cti is the first Pauli matrix. The k states are the eigenstates of the operator D, so they are the eigenstates 

of (7l. 



[37.2] A two site system with two different particles 



In this case the Hilbert space is four dimensional: dim ~ (2 x 2) = 4. If the two particles are different (for example, 
a proton and a neutron) then each state in the Hilbert space is "physical". The standard basis is: 

|1, 1) = |1) (g) |1) - particle A in site 1, particle B in site 1 

|1, 2) = |1) ® |2) - particle A in site 1, particle B in site 2 

|2, 1) = |2) ® |1) - particle A in site 2. particle B in site 1 

|2, 2) = |2) (g) |2) - particle A in site 2, particle B in site 2 

The transposition operator T swaps the location of the particles: 



T\i,j) = 



(1050) 



We must not make confusion between the transposition operator and the reflection operators: 



R^ 
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(1051) 



Instead of the basis |1,1), |1,2), |2,1), |2, 2), we may use the basis \A), |1,1), \S), |2, 2), where we have defined: 



= — (|1,2)-|2,1)) 
l^)-^(|l,2) + |2,l)) 



(1052) 



The state \A) is anti-symmetric under transposition, and all the others are symmetric under transposition. 



^^^^= [37.3] Placing together two identical particles 

The motivation for discussing this system stems from the question: is it possible to place two electrons in the same 
location so that one of the spins is up and the other is down, or maybe they can be oriented differently For example. 
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one spin left and one right, or one right and the other up. We shall continue using the terminology of the previous 
section. We may deduce, from general symmetry considerations that the quantum state of identical particles must 
be an eigenstate of the transposition operator (otherwise we could conclude that the particles are not identical). It 
turns out that we must distinguish between two types of identical particles. According to the "spin and statistics 
theorem" , particles with half-odd-integer spins (fermions) must be in an antisymmetric state. Particles with integer 
spins (bosons) must be in a symmetric state. 

Assume that we have two spin zero particles. Such particles arc Bosons. There is no problem to place two (or more) 
Bosons at the same site. If we want to place two such particles in two sites, then the collection of possible states is 
of dimension 3 (the symmetric states) , as discussed in the previous section. 

Electrons have spin 1/2, and therefore they are Fermions. Note that the problem of placing "two electron in one site" 
is formally analogous to the hypothetical system of placing "two spinless electrons in two sites". Thus the physical 
problem is formally related to the discussion in the previous section, and we can use the same notations. From the 
requirement of having an antisymmetric state it follows that if we want to place two electrons at the same location 
then there is only one possible state which is \A). This state is called the "singlet state". We discuss this statement 
further below. 

Let us try to be "wise guys". Maybe there is another way to squeeze to electrons into one site? Rather than placing 

one electron with spin "up" and the other with spin "down" let us try a superposition of the type | — ><— ) — | < >). 

This state is also antisymmetric under transposition, therefore it is as "good" as \A). Let us see what it looks like in 
the standard basis. Using the notations of the previous section: 

^ (I + -) - I - +)) = ^ (1+) ® I-) - |-> « 1+)) (1053) 
= ^((|1) + |2)) ® (|1) - |2))) - ^((|1) - |2)) ® (|1) + |2))) 
= --^(|1)®|2)-|2)®|1))==-|^) 

So, we sec that mathematically it is in fact the same state. In other words: the antisymmetric state is a single state 
and it does not matter if we put one electron "up" and the other "down", or one electron "right" and the other "left". 
Still let us try another possibility. Let us try to put one electron "up" and the other "right". Writing | "f^) in the 
standard basis using the notation of the previous section we get 

|1> ® 1+) = ® (|1> + |2» = ^(|1, 1) + |1,2» (1054) 

This state is not an eigenstate of the transposition operator, it is neither symmetric nor anti-symmetric state. Therefore 
it is not physical. 



[38] A few site system with Bosons 



[38.1] A two site system with N Bosons 



We discuss in what follows the Bose-Hubbard Hamiltonian for N Bose particles in a two site system. The dimension 
of the Hilbert space is JV = N + 1. Later we assume N ^ 1 so M ^ N. For simplicity we might further assume TV to 
be even, and define j = N/2, so as to have Af = 2j + 1. The Hamiltonian is 



n 



E 

1=1,2 



K 



{d\ai + a[a2) 



(1055) 



where we denote by K the hopping, by U the interaction, and by e = e2 — ei the bias. Since ni -I- n2 = iV is constant 



171 



of motion, the Hamiltonian for a given A'^ is formally equivalent to the Hamiltonian of a spin j particle. Defining 



J. 



^{ni -n2) 



J+ ~ a\a2 



(1056) 
(1057) 



we can re- write the Hamiltonian as 



Uj,~ eJz — KJx + const 



(1058) 



In the absence of interaction this is like having a spin in magnetic field with precession frequency f2 = (A", 0, e). In 
order to analyze the dynamics for finite U it is more convenient to re- write this Hamiltonian with canonically conjugate 
variables. Formally we are dealing with two coupled oscillators. So for each we can define action-angle variables in 
the following way (no approximations involved): 
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(1059) 



One should be aware that the translation-like operator e"'' is in fact non-unitary because it annihilates the ground 
state. We shall denote its adjoint operator as e"*''', so we have a\ = v^c^*'^'. Now it is possible to re- write the 
Hamiltonian using the conjugate variables n = ni and if = ifi — &s follows: 



n 



K 

y 



yJ{N -n){n+\) c'f + h.^ 



(1060) 



So far no approximation are involved. But now we want to allow an approximation which is valid for an iV 3> 1 
system. Consequently we ignore the small corrections that are associated with the proper ordering of the operators 
and write 



n 



Un^ - eh - Ky/{N - h)h cos(y) 



(1061) 



In the n ~ N/2 region of phase space this resembles the so-called Josephson Hamiltonian, which is essentially the 
Hamiltonian of a pendulum 



W,7o=cph=o„ = Ecih-nof - Ejcos{(p) 



(1062) 



with Ec = U and Ej = KN /2, while no is linearly related to e. The Josephson Hamiltonian is an over-simplification 
because it does not capture correctly the global topology of phase space. In order to shed better light on the actual 
Hamiltonian it is convenient to define in addition to the operator also a 6 operator such that Jz = {N/2) cos{6) 
while Jx « {N/2) sm{9) cos((p). It is important to realize that (p and 6 do not commute. In the {tp, n) language the 
defining relation is 



{N/2) l + cos(0) 



(1063) 



Using the {tp, 9) coordinates, the Hamiltonian takes the form 



n 



NK 



-u(cos 6)^ — e cos 6 — sin cos ip 



(1064) 
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where the scaled parameters are 

u^^, s^^ (1065) 

We can describe phase space using the canonical coordinates {(p,n), so the total area of phase space is 2ttN and 
Planck cell is 27Th with h~l. Optionally we can use spherical coordinates {ip,d), so the total area of phase space 
is 47r and Planck cell Air/N . Note that an area element in phase space can be written as dipdcosO. 

Within the framework of the semiclassical picture a quantum state is described as a distribution in phase space, and 
the eigenstates are associated with strips that are stretched along contour lines H{(p, 9) — E oi the Hamiltonian. The 
\n) states are the eigenstates of the A' = Hamiltonian. In particular the |n=A^) state (all the particles are in the 
first site) is a Gaussian-like wavepacket which is concentrated in the NorthPole. By rotation we get from it a family 
of states \9, tp) that are called coherent states. If U is large, e = 0, and K is very small, then the eigenstates are the 
symmetric and the antisymmetric superpositions of the | ± n) states. In particular we have "cat states" of the type: 

ICatState) = \ni=N,n2=0) + e'P^^^^'^|ni=0, n2=iV) (1066) 

If a coherent state evolves, then the non-linearity (for non zero U) will stretch it. Sometimes the evolved state 
might resemble a cat state. We can characterize the one-body coherence of a state by defining the one-body reduced 
probability matrix as 

= j^{a]a,) = i(l + (5).^) (1067) 

where Si = {2/N)Ji and the so called polarization vector is (S) = {{S^), {Sy), (Sz))- Note that 

AverageOccupation = {N/2) [1 + (5^)] (1068) 
OneBodyPurity = (1/2) [l + {S^f + {Syf + (S,)^] (1069) 

The coherent states have maximum OneBodyPurity. This purity can be diminished due to non-linear effects or due to 
interaction with environmental degrees of freedom. First example: If we prepare the particles in the ground state of 
the U = Hamiltonian of a symmetric double well, then we have a coherent state "polarized" in the x direction. If we 
turn on U, then this state will be stretched and eventually it will spread over the whole equator, thus leading to loss 
of the OneBodyPurity. Second example: The interaction with the environment leads to the loss of OneBodyPurity 
due to the loss of the ManyBodyPurity. If we have an environmental coUisional effect that " measures" in which well 
are the particles {Jz interaction), then it leads to dephasing towards the z axis, while if the coUisional effect is like 
measuring whether the particles are in the barrier region ( Jj, interaction) then the dephasing is towards the x axis. 



== [38.2] An M site system with A^^ Bosons 

A double well system with Bosons has formally the same Hilbert space as that of spin N/2 particle. The generalization 
of this statement is straightforward. The following model systems have formally the same Hilbert space: 



• The "N particle" states of a Bosonic system with M sites. 

• The "TV quanta" states of M Harmonic oscillators. 

• The states of a dim(iV) "spin" of the SU{M) group. 



We shall explain and formulate mathematically the above statements, and later we shall use the formal analogy with 
"spin" in order to shed light on the dynamics of N Bosons in M = 2 site system. 

To regard Bosonic site as an Harmonic "mode" is just a matter of language: We can regard the "creation" operator aj 
as a "raising" operator, and we can regard the "occupation" of the ith site hi = aja^ as the number of quanta stored 
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in zth mode. The one particle states of an M site system form an M dimensional Hilbert space. The set of unitary 
transformations within this space is the SU{M) group. Let us call these transformation generalized "rotations". If we 
have N particles in M sites then the dimensionality of Hilbert space is dim(A'') = (A^+1)!/[(M— 1)!A^— M+2)!]. For 
example for M = 2 system we have dim(A^) = 7V+1 basis states |ni, 712) with ni + rt2 = N. We can "rotate" the whole 
system using dim(Af) matrices. Thus we obtain a dim(Af) representation of the SU{M) group. By definition these 
"rotations" can be expressed as a linear combination of the SU{M) generators J^, and they all commute with N. 
We shall see that the many body Hamiltonian Tl may contain "non linear" terms such as that correspond to 
interactions between the particles. Accordingly Ti. of an interacting system is not merely a "rotation". Still we assume 
that Ti. commutes with A^. so we can work within dim(A^) subspace. 

There is a common mean field approximation which is used in order to analyze the dynamics of many Bosons system. 
In the semiclassical framework the dynamics in phase space is generated by the Hamilton equations of motion for the 
action- variables Ui and cpi. These arc the "polar coordinates" that describe each of the oscillators. Optionally one 
can define a "macroscopic wavcfunction" 

^pi = -y/nie*"^ representing a single point in phase space (1070) 

The equation for is known as the discrete version of the non-linear Schrodinger (DNLS) equation: 

,dipi I ^ ^^|_, 2A.T, ^ 



= (e, + [/|V'.|'J*,--(^,+i+V'.-i) (1071) 
In the continuum limit this equations is known as the Gross-Pitacvskii equation: 



.dtl^(x) 

I ; 

dt 



Vix) + gsmx)\^ - ^V^] i;{x) (1072) 



The potential V(x) corresponds to the of the DNLS equation, and the mass m is associated with the hopping am- 
plitude K using the standard prescription. In addition to that wc have the interaction parameter that corresponds 
to U multiplied by the volume of a site. It also can be related to the scattering length using the relation gs ~ ATras/m. 

Within the framework of the proper semiclassical treatment the quantum state is described as a distribution of points 
in phase space. The proper semiclassical treatment goes beyond the conventional mean field approximation. The 
latter assumes that in any time the state of the system looks like a coherent state. Such a state corresponds to a 
Gaussian-like distribution is phase space ("minimal wavepacket") and accordingly it is characterized by = (ui) and 
Tp^ = [(fi) or equivalently by the mean field 

"0; = (V'j) representing the center of a wavepaket (1073) 

To the extend that the mean field assumption can be trusted the equation of motion for ■i/;^ is the DNLS. Indeed 
if [/ = there is no non- linear spreading and the mean field description becomes exact. Otherwise it is a crude 
approximation. 



[39] Quasi ID network systems 

A ID segment can be regarded as the continuum limit of quasi OD chain. If the particle in confined between two 
ends, the model is know as "infinite well" or as "ID box". If there are periodic boundary conditions, the model is 
know as "ID ring". Several "ID boxes" can be connected into a network: each ID segment is called "bond", and the 
junctions arc called "vortices". 

The ends of an isolated bond can be regarded as hard walls. It is easily argued that the wavcfunction should satisfy 
Dirichlet boundary conditions at the boundary of a hard wall. The perturbation which is created by an infinitesimal 
displacement of the wall is finite (see section below). 

The coupling between two wavefunctions in a network due to the introduction of a connecting junction can be deduced 
by considering a wire which is divided by a high ud(x) barrier (see section below). 



174 



Wc characterize a network by specifying the lengths {La} of the bonds, and the the potentials u at each vertex. Given 
this information we would like to find the eigenfunctions and the eigenenergies. We describe below two procedures for 
that purpose: one is based on a straightforward generalization of the standard matching conditions, while the other 
is based on the scattering matrix formalism. 



[39.1] Hard walls 



Let us assume that we have a particle in a one dimensional box of length L, such that V{x) = within the interval 
< a; < L. We assume Dirichlet boundary conditions at x = 0, while at x = Lq we assume a potential step, such that 
V{x) = Uq for X > Lq. We are going to see that if we take the limit t/o — > oo, then this implies Dirichlet boundary 
conditions at x ^ Lq too. Then we are going to find the perturbation term that describes a dL displacement of the 
wall. 

The wavefunction of nth cigenstatc has to satisfy the Dirichlet boundary conditions at x = 0, and therefore has to be 
of the form 

,, > ( Asm(kx) tor < x < L 

^(^) = I Be-- for x>L (l^^^) 

where 



k = V2mE (1075) 
a = ^/2m{Uo - E) ^/2mUo 

The normalization factor at the limit Uq ^ oo is A = {2/Ly^^, and it would be convenient later to multiply the 
functions by (—1)", so as to have positive slope at x = L. The matching condition at x = L is: 



= -a (1076) 

x=L+0 



ip{x 

So the eigenvalue equation is 

kcot{kL)^-a (1077) 

In the limit Uq oo the equation becomes sm{kL) — which implies that the unperturbed eigen-energies are 
En = fc^j/2m with A:„ = {Tr/L)n. In this limit we see that the matching condition at x ~ L simply forces the wave- 
function to be zero there. Thus we have established that Hard walls implies Dirichlet boundary conditions. 

In order to find the perturbation due to a small shift dL is the position of the wall we keep Uq finite, though we 
assume it to be very very large. We shall take the strict limit C/q ^ oo only at the end of the calculation. For L = Lq 
the unperturbed Hamiltonian after digonalization is 

[Ho]™„ = (^n) S„m (1078) 

Zm \Lq / 

If we displace the wall a distance dL the new Hamiltonian becomes H = Hq + dLV. We ask what are the matrix 
elements Vnm of this perturbation. At first sight it looks as if to displace an "infinite wall" constitutes "infinite 
perturbation" and hence Vnm = oo. But in fact it is not like that. We shall see that 

Vnm = -prnm (1079) 

mLl 
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For the following calculation wc point out that for very large Uq the derivative of the nthe wavefunction at the wall is 
d 



I K (1080) 

x=L 1 ^ 



The Hamiltonian of the original system (before we displace the wall) is: 

Uo^-^ + Uiix) (1081) 
The Hamiltonian after we have displaced the wall a distance dL is: 

n^-^ + U2{x) (1082) 
So the perturbation is: 

SUix) = U2ix)-Ui{x) = dLxV (1083) 
which is a rectangle of width dL and height —Uq. It follows that the matrix elements of the perturbation are 
1 



dL 



L+dL 
L 



^(")(x)[-C/o]V^'"^(x)dx = -Uo^P^^Hl)^^^\l) = -Uo^ (^^^"^(^)) (^^^"^^^O 



where in the last step we have used the "matching conditions". Now there is no problem to take the limit oo 

= -^'2^0 (V?''") (/l^") = (^) = " ^'^^^^ 

It is easily verified that the diagonal terms VnndL of the perturbation matrix give correctly the first order shift dEn 
of the energy levels due to the dL displacement. 

^^^^= [39.2] The definition of a delta junction 



The simplest junction is composed oi A4 ~ 2 wires that are connected at one point. We can model such a junction as 
a delta barrier V{x) = ud{x). The matching condition at x = is implied by the Schrodinger equation, and relates 
the jump in the derivative to the value of the wavefuntion at that point. 

-5- [dtpi+O) - aV'(-O)] = u ^/.(O) (1085) 
2m 

A more elegant way of writing this relation is 

= MmmP{0) (1086) 



E dipa 
dr 

a=l 



r=0 



with M ~ 2. The derivative of the radial functions ipii''') = and '4'2{t) = '0(+r) is with respect to the radial 

coordinate r = \x\. It is implicit that these wavefuntions should have the same value at the meeting point r ~ 0. If we 
have M wires connected at one point we can define a generalized "delta junction" using the same matching condition. 
Say that we have a = 1,2,3 leads. The matching conditions at the junction are V'i(O) = V'2(0) = V'3(0) = V'(O) while 



176 



the sum over the radial derivatives should equal A4mmlj{0). This means that the sum over the outgoing currents is 
zero. 

More generally a junction can be fully characterized by its S matrix. So we can ask what is the S matrix of a delta 
junction. It is straightforward to find out that the S matrix which is implied by the above matching conditions is 



Sab 



1 



[such that S=l for u=oo\ 



M \1 + i{u/vE) 
For u = we get zero reflection if = 2, while if = 3 we get 



(1087) 



^+1/3 -2/3 -2/3^ 
S= { -2/3 +1/3 -2/3 
-2/3 -2/3 +1/3, 



(1088) 



In the limit ^ oo we get total reflection. 

Sometimes we want to treat the connecting junction as a perturbation. Consider for example the simplest possibility 
of having two ID boxes connected at a; = hence forming a double well structure which is described by a Hamiltonian 
Ti.{x,p]u) in which the barrier is represented by V{x) ~ uS{x). The eigenstates n of the unperturbed Hamiltonian 
7i(a;,p;oo) are those of the left box together with those of the right box. We want to have an explicit expression 
for the perturbation Wnm due to the couphng at x = 0. Note that W is defined as the difference H{u) — H{oo) so 
it is not the same as V = Ti{u) — H{0). We can obtain the total perturbation W from a sequence of infinitesimal 
variations of the barrier height starting from u — oo. Namely, 



niu) 



■H{oo) 



m 

du 



du' 



■H{oo) + W 



(1089) 



For any value of u the Hilbert space of the system is spanned by a set of (real) eigenfunction labeled by n. The matrix 
elements for an infinitesimal variation of the barrier height is 



dU 

du 



- ,/,(") 



^(n)(0) ^(™)(0) = 



1 



(2mu)2 









. dr 





. dr . 



(1090) 



where is the last step we have used the matching conditions in order to express the wave function by the total radial 
derivative. For a large barrier with small transmission the nth and the mth states remain similar to the unperturbed 
states and accordingly, upon integration, we get the result [http://arxiv.org/abs/0807.2572]: 



1 










. dr 





. dr . 



(1091) 



The d/ dr in this expression is the total radial derivative, but in fact the contribution comes from one term only, 
because the unperturbed wavefunction of a given eigenstate ■f/'^"-' is non-zero only in one box. 



== [39.3] Finding the eigenstates of a network 

Consider a network which is composed of b bonds and v vertexes. The wavefunction on bond a = that connects 
vertex i to vertex j is written as 

M^) = Bac'"^^ + Aae-'''^^ (1092) 

where < a; < Lq is the position of the particle along the bond with the origin at vertex i. Note that the wavefunction 

on a = (j, i) is with Aa = BaC*'"""^" and Ba = Aae~^^^^°- ■ The wavenumber is ka = \j2mE + [(\)alLa)^ where is 
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determined by the vector potential. Note that (f>a = —4>a- For simplicity we assume below that there is no magnetic 
field, and accordingly fcg = fc^. 

The most economical way to represent the wavefunction is by a vector ip i-^ {'0i,'02, ...,'01,} that contains the wave 
amplitudes at the vertexes. If we know this vector then we can construct the whole wavefunction on bond a = 
using the formula 

tpa{x) = ■ r \ 'ipiSin{ka{La-x)) + 'il}jSm{kax) (1093) 
sm(fcaLQ) L J 

The in order to find the eigenstates we simply have to write the v matching conditions of the vertexes, and look for 
the energies E for which there is non-trivial solution for this homogeneous set of equations. 

Optionally the wavefunction can be represented by the vector A = {Aa} of length 26, or equivalently by the vector 
B = {Ba}. The wavefunction and its gradient in a particular vertex are related to the latter as follows: 

ij = B + A (1094) 
(9V = ik X {B- A) (1095) 



If the junctions are represented by a scattering matrix and not by a simple matching condition there is an optional 
procedure for finding the eigenstates. The A and the B vectors are related by 



B = S A 
A ^ J c 



kL 



B 



(1096) 
(1097) 



where S' is a 26 x 26 matrix that relates the outgoing to the ingoing fluxes, and J is a 26 x 26 permutation matrix 
that induces the mapping a s- a, and L = diag{La}, and k = diaglfca}. The equation for the eigenstates is 



(Je 



ikL ( 



DA 







(1098) 



We can get from this equation a set of eigenvalues En with the corresponding eigenvectors A*^") and the associated 
amplitudes = 5A("). 



The S matrix of the network (if appropriately ordered) has a block structure, and can be written as S' = , 
where is the Vj x Vj block that describes the scattering in the jth vertex. Vj is the number of leads that stretch out 
of that vertex. A delta function scatterer is regarded as a Vj = 2 vertex. Let us construct a simple example. Consider 
a ring with two delta barriers. Such ring can be regarded as a network with two bonds. The bonds are labeled as 
12a, 12b, 21a, 21b, where A and B distinguish the two arms. The matrices that define the system are 



/n 


ti 








ti 


ri 














r2 


t2 


Vo 





t2 


r2 



(La 

Lb 

La 

V ^ Lbi 



J = 



/O 1 0\ 

1 

10 

Vo 1 0/ 



(1099) 



Finally we note that k = y/2mE + 4>/L, where the fluxes matrix is = diag{(/)a}. If there is a single flux line cf) one 
can write (p = (j)P, where P can be expressed as a linear combination of the channel projectors Pa- For example, if 
only one wire a encloses the flux line, then P = Pa — Pa and we get 

k = lV2mE + <j)^ (1100) 

h L 

The obvious application of this procedure is for the analysis of a multi-mode Aharonov-Bohm ring, where the "bonds" 
are the propagation-modes of the ring, and they all enclose the same flux line. 
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QM in Practice (part II) 



[40] Approximation methods for finding eigenstates 



[40.1] The WKB approximation 



In the next sections we discuss the canonical version of perturbation theory. The smaU parameter is the strength 
of the perturbation. Another family of methods to find approximate expressions for the eigen-functions is based on 
treating as the small parameter. The ft in this context is not the ft of plank but rather its scaled dimensionless 
version that controls quantum-to-classical correspondence. For a particle in a box the scaled ft is the ratio between the 
De-Broglie wavelength and the linear size of the box. Such approximation methods arc known as semi-classical. The 
most elementary example is known as the WKB (Wentzel, Kramers, Brillouin) approximation [Messiah p. 231]. It is 
designed to treat slowly varying potentials in ID where the wavefunction looks locally like a plane wave (if V{x) < E) 
or as a decaying exponential (in regions where V{x) > E). The refined version of WKB, which is known as "uniform 
approximation", allows also to do the matching at the turning points (where V{x) ^ E). The generalization of the 
d = 1 WKB to d > 1 dimensions integrable systems is known as the EBK scheme. There is also a different type of 
generalization to d > 1 chaotic systems. 

Assuming free wave propagation, hence neglecting back reflection, the WKB wavefunction is written as 



(1101) 



This expression is inserted into the ID Schrodinger equation. In leading order in ft we get a continuity equation for 
the probability density p{x)^ while the local wavenumber should be as expected 



dS(x) 
dx 



7{x) = y'2m{E -V{x)) 



(1102) 



Hence (for a right moving wave) one obtains the WKB approximation 



ip{x) 



zEXP 



i / p{x')dx 



(1103) 



where xq is an arbitrary point. For a standing wave the "EXP" can be replaced by either "sin" or "cos". It should 
be clear that for a "fiat" potential fioor this expression becomes exact. Similarly in the "forbidden" region we have a 
decaying exponential. Namely, the local wavenumber ±p(x) is replaced by ±iQ;(x), where a{x) = {2m{V{x) — i?))^/^. 

If we have a scattering problem in one dimension we can use the WKB expression (with exp) in order to describe 
(say) a right moving wave. It should be realized that there is no back-reflection within the WKB framework, but still 
we can calculate the phase shift for the forward scattering: 



p{x)dx 



psdx 



V2mE 



1-^-1 



E 



dx 



^y2m{E ~ V{x)) - V2mE 
V{x)dx 



dx 



(1104) 



Hence we get 



1 f°° 
-7— / Vix)da 

n-VE J-oo 



(1105) 
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It should be noted that 9^^^ is the phase shift in a ID scattering geometry {—oo < x < oo). A similar looking result 
for semi-ID geometry (r > 0) is known as the Born approximation for the phase shift. 

If we have a particle in a well, then there are two turning points xi and X2- On the outer sides of the well we have 
WKB decaying exponentials, while in the middle we have a WKB standing wave. As mentioned above the WKB 
scheme can be extended so as to provide matching conditions at the two turning points. Both matching conditions 
can be satisfied simultaneously if 

p{x)dx ^ {]■ + n)TTh (1106) 

where n = 0, 1, 2, . . . is an integer. Apart from the 1/2 this is a straightforward generalization of the quantization 
condition of the wavenumber of a particle in a ID box with hard walls {k x (x2 — xi) = mi). The (l/2)7r phase shift 
arise because we assume soft rather than hard walls. This 1/2 becomes exact in the case of harmonic oscillator. 

The WKB quantization condition has an obvious phase space representation, and it coincides with the " Born Oppen- 
heimer quantization condition" : 

p{x)dx ^ {^+n)2'Kh (1107) 

The integral is taken along the energy contour which is formed by the curves p = ±p(x). This expression implies that 
the number of states up to energy E is 

NiE) ^ (1108) 

JJh(x,p)<e '^^n. 

The d > 1 generalization of this idea is the statement that the number of states up to energy E is equal to the phase 
space volume divided by {2-KhY. The latter statement is known as Weyl law, and best derived using the Wigner-Weyl 
formalism. 



^=^= [40.2] The variational scheme 

The variational scheme is an approximation method that is frequently used either as an alternative or in combination 
with perturbation theory. It is an extremely powerful method for the purpose of finding the ground-state. More 
generally we can use it to find the lowest energy state within a subspace of states. 

The variational scheme is based on the trivial observation that the ground state minimize the energy functional 



F[^] = (1109) 

If we consider in the variational scheme the most general ij: we simply recover the equation Titl) = E-ip and hence 
gain nothing. But in practice we can substitute into F[] a trial function -0 that depends on a set of parameters 
X = {Xi, X2, ...). Then we minimize the function F{X) = F[ip] with respect to X. 

The simplest example is to find the ground state on an harmonic oscillator. If we take the trail function as a Gaussian 
of width (T, then the minimization of the energy functional with respect to a will give the exact ground state. If we 
consider an anharmonic oscillator we still can get a very good approximation. A less trivial example is to find bonding 
orbitals in a molecules using as a trial function a combination of hydrogen-like orbitals. 
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== [40.3] Perturbation theory - motivation 

Let us consider a particle in a two dimensional box. On the left a rectangular box, and on the right a chaotic box. 




For a regular box (with straight walls) we found: En^.n^ oc {ux/ L^Y + [f^y/LyY so if we change we get the energy 
level scheme which is drawn in the left panel of the following figure. But if we consider a chaotic box, we shall get 
energy level scheme as on the right panel. 




The spectrum is a function of a control parameter, which in the example above is the position of a wall. For generality 
let us call this parameter X. The Hamiltonian is 7i(Q, P; X). Let us assume that we have calculated the levels either 
analytically or numerically for X = Xq. Next we change the control parameter to the value X = Xo + A. Possibly, if 
5X is small enough, we can linearize the Hamiltonian as follows: 

n = H(Q,P;Xo) + XV{Q,P) = Ho + XV (1110) 

With or without this approximation we can try to calculate the new energy levels. But if we do not want or cannot 
diagonalize Ti. for the new value of X we can try to use a perturbation theory scheme. Obviously this scheme will 
work only for small enough A that do not "mix" the levels too much. There is some "radius of convergence" (in A) 
beyond which perturbation theory fails completely. In many practical cases X = Xq is taken as the value for which 
the Hamiltonian is simple and can be handled analytically. In atomic physics the control parameter X is usually 
either the prefactor of the spin-orbit term, or an electric field or a magnetic field which are turned on. 

There is another context in which perturbation theory is very useful. Given (say) a chaotic system, we would like to 
predict its response to a small change in one of its parameters. For example we may ask what is the response of the 
system to an external driving by either an electric or a magnetic field. This response is characterized by a quantity 
called "susceptibility". In such case, finding the energies without the perturbation is not an easy task (it is actually 
impossible analytically, so if one insists heavy numerics must be used). Instead of "solving" for the eigenstates it 
turns out that for any practical purpose it is enough to characterize the spectrum and the eigenstates in a statistical 
way. Then we can use perturbation theory in order to calculate the "susceptibility" of the system. 
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== [40.4] Perturbation theory - a mathematical digression 

Let us illustrate how the procedure of perturbation theory is applied in order to find the roots of a toy equation. 
Later we shall apply the same procedure to find the eigenvalues and the eigenstates of a given Hamiltonian. The toy 
equation that we consider is 

x + Xx^=3 (1111) 
We assume that the magnitude of the perturbation (A) is small. The Taylor expansion of x with respect to A is: 

x(A) = + a;(i)A + x^^h^ + x^^'^X^ + ... (1112) 
The zero-order solution gives us the solution for the case A = 0: 

x'-°^=3 (1113) 
For the first-order solution we substitute a;(A) = x'*'-' + .t'^'A in the equation, and get: 

[x'"' + xti'A] + A[a;(°) + x*-^^ X]^ = 3 (1114) 
- 3] -f A[x(i) + (x("))5] + 0(A2) = 

By comparing coefScients we get: 

a;(°)-3==0 (1115) 
^(1) + (a;(0))5 ^ 

Therefore: 

^(1) = _(^(0))5 ^ _35 (^^^g) 

For the second-order solution, we substitute x{X) = x^"^ + x'^'A -I- x'^^-'A^ in the equation, and get: 

[^(0) + ^Wx + :j(2)a2] + A[a;W + x^^^X + x^^^X^]" = 3 (1117) 
Once again, after comparing coefficients (of A^), we get: 

= -5(.T("')4a;(i) = 5 X 3^ (1118) 
It is obviously possible to find the corrections for higher orders by continuing in the same way. 
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[41] Perturbation theory for the eigenstates 
== [41.1] Degenerate perturbation theory (zero-order) 

We will take a general Hamiltonian after diagonalization, and add a small perturbation that spoils the diagonalization: 
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(1119) 



The Hamiltonian can be visualized using an energy level diagram: 
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The eigenvectors without the perturbation are: 
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(1120) 



The perturbation spoils the diagonalization. The question we would like to answer is what are the new eigenvalues 
and eigenstates of the Hamiltonian. We would like to find them "approximately", without having to diagonalize the 
Hamiltonian again. First we will take care of the degenerated blocks. The perturbation can remove the existing 
degeneracy. In the above example we make the following diagonalization: 

/l 0\ /O 1 0\ /1.97 0\ 

2-0 1 + 0.03 -1 ^ 2.03 (1121) 

yooiy yoooy \ o 02/ 




We see that the perturbation has removed the degeneracy. At this stage our achievement is that there are no matrix 
elements that couple degenerate states. This is essential for the next steps: we want to ensure that the perturbative 
calculation would not diverge. 



183 



For the next stage we have to transform the Hamihonian to the new basis. See the calculation in the Mathematica 
file "diagonaUze.nb" . If we diagonahzc numerically the new matrix we find that the eigenvector that corresponds to 
the eigenvalue E w 5.003 is 
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(1122) 



We note that within the scheme of perturbation theory it is convenient to normalize the eigenvectors according to the 
zero order approximation. We also use the convention that all the higher order corrections have zero overlap with the 
zero order solution. Else the scheme of the solution becomes ill defined. 



== [41.2] Perturbation theory to arbitrary order 

We write the Hamiltonian as 7i = Hq + XV where V is the perturbation and A is the control parameter. Note that 
A can be "swallowed" in V. We keep it during the derivation in order to have clear indication for the "order" of the 
terms in the expansion. The Hamiltonian is represented in the unperturbed basis as follows: 
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A^|n)K 



(1123) 



which means 
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In fact we can assume without loss of generality that Vn^m = for n ~ m, because these terms can be swallowed into 
the diagonal part. Most importantly we assume that none of the matrix element couples degenerated states. Such 
couplings should be treated in the preliminary "zero order" step that has been discussed in the previous section. 

We would like to introduce a pcrturbativc scheme for finding the eigenvalues and the eigcnstates of the equation 

(Ho + XV)\-^) ^ El'^) (1125) 
The eigenvalues and the eigenvectors are expanded as follows: 

E = E^°^ + XE^^^ + A^^I^] _^ . . . 1^1126) 
^„ *[ol + A*W + A2*[21 



where it is implicit that the zero order solution and the normalization are such that 

- ,5 

^[1,2,3,...] ^ Q f-Q^. „ = „^ 

It is more illuminating to rewrite the expansion of the eigenvector using Dirac notations: 



(1127) 



K) = |n["l)+A|nW)+A2|4^1) + 



(1128) 
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hence 

(,^["1 \no) = (nM + A(7^["l |nW) + X'(n^'^ |n{fl) + • • • (1129) 

which coincides with the traditional notation. In the next section we introduce a derivation that leads to the following 
practical results: 

*L"' = Kno (1130) 



El"] 



'"0 



J-^ — ''no, no 



The calculation can be illustrated graphically using a "Feynman diagram". For the calculation of the second order 
correction to the energy we should sum all the paths that begin with the state uq and also end with the state uq. 
We see that the influence of the nearer levels is much greater than the far ones. This clarifies why we cared to treat 
the couplings between degenerated levels in the zero order stage of the calculation. The closer the level the stronger 
the influence. This influence is described as "level repulsion". Note that in the absence of first order correction the 
ground state level always shifts down. 



[41.3] Derivation of the results 



The equation we would like to solve is 
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*2 +^ *2 =S *2 I (1131) 



Or. in index notation: 

£n^n + A^K.m*™ = (1132) 

m 

This can be rewritten as 

(£;-£„)*„ = A ^K.m^-m (1133) 

We substitute the Taylor expansion: 

£; = ^A'=£;W^ = S["1+A£W+... (1134) 
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We recall that = e„„, and 
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/•••\ 




1 




v.. 7 



/■■■\ 

? 

? 



? 

? 

v.'.y 



After substitution of the expansion we use on the left side the identity 



(ao + Aai + A^aa + . . . )(&o + A6i + A^^z + •••) = X! '^'^ X! ^k'bk^k 

k fe'=0 



Comparing the coefficients of A*^ we get a system of equations k — 1, 2, 3.. 



rffe'l,T,ffe-fc 

fc'=0 



-1] 



We write the kth equation in a more expanded way: 



If we substitute n = uq in this equation we get: 



+ =Xl4o,m«' 



If we substitute n ^ uq in this equation we get: 



m fc' = l 



[fe-fe'] 



Now we see that we can solve the system of equations that we got in the following order: 

^[0] ^ ^[1] ^ ^[1] ^ ^[2] ^ 



Where: 



k'=l 



The practical results that were cited in the previous sections are easily obtained from this iteration 
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[42] Perturbation theory / Wigner 



[42.1] The overlap between the old and the new states 



We have found that the perturbed eigenstates to first-order are given by the expression 

K)-|n[°l)+E7%-|"™) (1143) 

^ trio t„ 

So, it is possible to write: 

« for 71 7^ no (1144) 

which imphcs 

P{n\m) = |(n[0l|m)P « ,^'^""1! forn ^ no (1145) 

In the latter expression we have replaced in the denominator the unperturbed energies by the perturbed energies. 
This is OK because in this level of approximation the same result is obtained. In order for our perturbation theory 
to be valid we demand 

|y|<A (1146) 

In other words, the perturbation must be much smaller than the mean level spacing. We observe that once this 
condition breaks down the sum X]m-^("'l'^) becomes much larger then one, whereas the exact value is exactly one 
due to normalization. This means that if 3> A the above first order expression cannot be trusted. 

Can we do better? In principle we have to go to higher orders of perturbation theory, which might be very complicated. 
But in fact the generic result that comes out is quite simple: 

Pin\m) « , , (1147) 

This is called "Wigner Lorentzian". As we shall see later, it is related to the exponential decay law that is also named 
after Wigner ("Wigner decay"). The expression for the "width" of this Lorentzian is implied by normalization: 

r = J|yp (1148) 

The Lorentzian expression is not exact. It is implicit that we assume a dense spectrum (high density of states). We 
also assume that all the matrix elements are of the same order of magnitude. Such assumption can be justified e.g. 
in case of chaotic systems. In order to show that -^'("■1"^) = 1 one use the recipe: 



where A is the mean level spacing. In the following we shall discuss further the notion Density of States (DOS) and 
Local Density of States (LDOS) which arc helpful in further clarifying the significance of the Wigner Lorentzian. 
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= [42.2] The DOS and the LDOS 

When we have a dense spectrum, we can characterize it with a density of states (DOS) function: 



(1150) 



We notice that according to this definition: 



E+dE 



Q{E')dE' = number of states with energy E < En < E + dE 



(1151) 



If the mean level spacing A is approximately constant within some energy interval then g{E) ~ 1/ A. 

The local density of states (LDOS) is a weighted version of the DOS. Each level has a weight which is proportional 
to its overlap with a reference state: 



piE) = J2mn)\'SiE^En) 



(1152) 



The index n labels as before the eigenstates of the Hamiltonian, while ^ is the reference state. In particular can be 
one of the eigenstates of the unperturbed Hamiltonian. In such case the Wigner Lorcntzian approximation implies 



piE) 



1 



(r/2) 



niE-En„y- + {T/2)^ 
It should be clear that by definition we have 

/oo 
p{E)dE^Y.\('^\^)\'-^ 
-OO 



(1153) 



(1154) 




==^ [42.3] Wigner decay and its connection to the LDOS 

Let us assume that we have a system with many energy states. We prepare the system in the state \^!). Now we 
apply a field for a certain amount of time, and then turn it off. What is the probability P{t) that the system will 
remain in the same state? This probability is called the survival probability. By definition: 



P(t) = |(*(0)|*(t))p 



(1155) 
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Let Ho be the unperturbed Hamiltonian, while H is the perturbed Hamiltonian (while the field is "on"). In what 
follows the index n labels the eigenstates of the perturbed Hamiltonian H. We would like to calculate the survival 
amplitude: 

(*(0)|vl/(t)) = imim) = 5](n|*)pe-^"* (1156) 



We notice that: 



(«'(0)|^'(i)) = FT 



FT [2ttp{u 



(1157) 



If we assume that the LDOS is given by Wigner Lorentzian then: 



Pit) = FT[2np{E)] 



-rt 



(1158) 



Below we remind ourselves of the customary way to perform the Fourier transform in this course; 



fit) = 



/(t)e*"*di 
cLu 



(1159) 



2tt 



F(w)e- 



The Wigner decay appears when we "break" first-order perturbation theory. The perturbation should be strong 
enough to create transitions to other levels. Else the system stays essentially at the same level all the time {P{t) w 1). 
Note the analogy with the analysis of the dynamics in a two level system. Also there, in order to have for P{t) 
large amplitude oscillations, the hopping amplitude should be significantly larger compared with the on site energy 
difference. 
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[43] Decay into a continuum 
== [43.1] Definition of the model 



In the problem of a particle in a two site system, we saw that the particle oscillates between the two sites. We will 
now solve a more complicated problem, where there is one site on one side of the barrier, and on the other side there 
is a very large number of energy levels (a "continuum"). 




We will see that the particle "decays" into the continuum. In the two site problem, the Hamiltonian was: 



n = 



Eo a 
a El 



(1160) 



Where a is the transition amplitude through the barrier. In the new problem, the Hamiltonian is: 



n 



/Eo 


a 


a 


a 


a 


El 








a 





E2 





a 








E3 


\ • 









(1161) 



We assume a mean level spacing A between the continuum states. If the continuum states are states of a one- 
dimensional box with length L, then the quantization of the momentum is tt/X, and from the relation between the 
energy and the momentum (dE — VEdp) we get: 



A = v,- 

From Fermi golden rule we expect to find the decay constant 

r = 2TT^a^ 
A 

Below we shall see that this result is exact. We are going to solve both the eigenstate equation 
and the time dependent Schrodinger's equation 



(1162) 



dt 



(1163) 



(1164) 



(1165) 
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where the perturbation term V includes the coupHng elements a. 



[43.2] An exact solution for the eigenstates 



The unperturbed basis is |0), |fc) with energies Eq and Ek- The level spacing of the quasi continuum states is A. The 
couplings between the discrete state and the quasi continuum states all equal a. The system of equations for the 
eigenstates \n) is 



fe 

Sfe^-fc + a^o - E^k (1167) 
From the second line we get how the eigenstates look like: 

where 'I'o should be determined by normalization, while the eigenenergies E = En are determined by the equation 
E^-^-^o (1169) 

For equal spacing this can be written as 

cotfn^) =-^{E-En) (1170) 

leading to the solution 

En^(n+^^„^A (1171) 

where f changes monotonically from tt to 0. Now we can determine the normalization, leading to 

^0= , = (1172) 

^{En - EoY + (r/2)2 

where 



This is the Wigner Lorentzian. It gives the overlap of the eigenstates with the unperturbed discrete state. A related 
result is due to Fano. Assuming that we have an initial state \F) that has coupling Vl to the state |0) and a coupling 
ap to the states \k). Then we get for the decay amplitudes: 

The decay amplitudes to levels En that are far away from the resonance is ap as in the unperturbed problem. On 
the other hand, in the vicinity of the resonance we have levels to which the decay is suppressed due to destructive 
interference. 
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[43.3] An exact solution of the time dependent problem 



We switch to the interaction picture: 



iE„t 



We distinguish between cu and cq, and get the system of equations: 



dt 

. dck 
' dt 



From the second equation we get: 



Ck{t) - - z / e^(^'=-^°)*>,,oCo(i')rfi' 
Jo 



By substituting into the first equation we get: 



dt 



C(t-t')co{t')dt' 



Where: 



C(t-0 = J2\v,,o 



|2g-i(-Efc-Bo)(t-t') 



(1175) 



(1176) 



(1177) 



(1178) 



(1179) 



The Fourier transform of this fmiction is: 



(1180) 



Which means that: 



C{t - t') 



27r 



5{t-t') = TS{t-t') 



(1181) 



We notice that the time integration only "catches" half of the area of this function. Therefore, the equation for cq is: 

(1182) 

This leads us to the solution: 



dco r 



Pit) = \co{t)\' = 



(1183) 
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= [43.4] The Gamow Formula 

A particle of mass m in ID is confined from the left by an infinite potential wall and from the right by a delta function 
U{x) = u5{x — a). We assume large u such that the two regions are weakly coupled. In this section we shall derive 
an expression for the decay constant F. We shall prove that it equals the "attempt frequency" multiplied by the 
transmission of the barrier. This is called Gamow formula. 



V 




x=0 x=a X 



We will look for complex energy solutions that satisfy "outgoing wave" boundary conditions. Setting k = kr — ijr we 
write the energy as: 



r, 

^^2;^ = ^^-^T 



(1184) 



m 

In the well the most general solution of Till) ~ Eip is ip{x) = Ac^'^'-^ + Bc^^^^ . Taking the boundary condition ■0(0) = 
at the hard wall, and the outgoing wave boundary condition at infinity we get 



^^{x) = C sm.(kx) 



for < a; < a 



(1185) 



il){x) = De'"^ for X > a 
The matching conditions across the barrier are: 



?/'(a + 0) -i/;(a-0) 
?A'(a + 0)-V'(a-0) = 2aV(a) 



(1186) 



where 



a = m 



(1187) 



Thus at x a the wave functions must fulfill: 



+ '0 



2a 



(1188) 



leading to 



ik — k cot{ka) — 2a 



(1189) 
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We can write last equation as: 

_k_ 

tan(fca) = (1190) 

1 - iir- 

The zero order solution in the coupling (u = oo) are the energies of an isolated well corresponding to 

k = kn = -n (1191) 
a 

We assume small coupling c and expand both sides of the equation around fc„ . Namely we set k = (fc„ + Sk) — ij 
where Sk and 7 are small corrections to the unperturbed energy of the isolated state. To leading order we get: 

h / A- ^ ^ 



«'^fc-*«7 = -^-*(^^) (1192) 

From the last equation we get: 

1 k 

kr = k„ +5k = kn - - tP- (1193) 
a 2a 

a\2a) 

From here we can calculate both the shift and the "width" of the energy. To write the result in a more attractive way 
we recall that the transmission of the delta barrier at the energy E = En is 



l + (a//c)2 \a 

hence 



1 f kn\^ VE 



(1194) 



r. « 2vE- ^ f'g (1195) 

a \ 2a J 2a 



This is called Gamow formula. It reflects the following semiclassical picture: The particle oscillates with velocity 
ve = ^j2Elvn inside the well, hence VE/{2a) is the number of collisions that it has with the barrier per unit time. 
The Gamow formula expresses the decay rate as a product of this "attempt frequency" with the transmission of the 
barrier. It is easy to show that the assumption of weak coupling can be written as (7 <C 1. 
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== [43.5] From Gamow to the double well problem 

Assume a double well which is divided by the same delta function as in the Gamow decay problem. Let us use the 
solution of the Gamow decay problem in order to deduce the oscillation frequency in the double well. 




Going back to the Gamow problem we know that by Fermi golden rule the decay rate is 

r = ^|14fc|' (1196) 

where Vnk is the probability amplitude per unit time to make a transitions form level n inside the well to any of the 
k states outside of the well. It is assumed that the region outside of the well is very large, namely it has some length 
L much larger than a. The k states form a quasi-continuum with mean level spacing A^. This expression should be 
compared with the Gamow formula, which we write as 

r - - i^.A„ (1197) 

where g is the transmission of the barrier, and is the mean level spacing in the small (length a) region. The 
Gamow formula should agree with the Fermi golden rule. Hence we deduce that the transition amplitude is 

1 



|l/„fc| = — VffAaAi (1198) 

With some further argumentation we deduce that the coupling between two wavcfunctions at the point of a delta 
junction is: 

Vnk = -j^[d^^''^[d^^''^] (1199) 

where dip is the radial derivative at the point of the junction. This formula works also if both functions are on the 
same side of the barrier. A direct derivation of this result is also possible but requires extra sophistication. 

Now we can come back to the double well problem. For simplicity assume a symmetric double well. In the two level 
approximation n and k are "left" and "right" states with the same unperturbed energy. Due to the coupling we have 
coherent Bloch oscillation whose frequency is 

n = 2\Vnk\ = -VqA^ - —V9 (1200) 
TT a 
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[44] Scattering resonances 

== [44.1] Fabry Perrot interference / transmission resonance 

The Fabry Perrot problem is to find the transmission of a double barrier given the transmission of each barrier and 
their "optical distance" : (i.e. the distance in wavelength units which determines phase difference). For clarity we may 
assume that the barriers are represented by delta functions. The conventional way of solving this problem is to match 
together the solutions in the three segments. Another way of solving this problem is to "sum over paths", similar to 
the way one solves the interference problem in the two slit geometry. 



Wc take as given the transmission coefficient T = |tp. and the reflection coefficient R = |rp = 1 — T. If the distance 
between the two barriers is a, then a wave accumulates a phase = kL when going from the one barrier to the other. 
The transmission of both barriers together is: 

transmission = \t x e*"^ x (1 + {re''t'f + {re"^)^ + . . . ) x (1201) 

Every round trip between the barriers includes two reflections, so the wave accumulates a phase (re"^)^. We have a 
geometrical series, and its sum is: 



transmission = 



t X 



1 - (re'"^)^ 



X t 



(1202) 



After some algebra we find the Fabry Perrot expression: 

1 

transmission = ; — j—rr-, , ,,„ (1203) 

l + 4[i?/r^](sin((/)))2 ^ ' 

We notice that this is a very "dramatic" result. If we have two barriers that are almost absolutely opaque R ^ 1, 
then as expected for most energies we get a very low transmission of the order of magnitude of T^. But there are 
energies for which = tt x integer and then we find that the total transmission is 100%! In the following figure we 
compare the two slit interference pattern (left) to the Fabry Perrot result (right): 
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[44.2] Scattering on a single level system 



The most elementary example for a resonance is when the scattering region contains only one level. This can be 
regarded as a simplified version of the Fabry-Pcrrot double barrier problem, or it can be regarded as simplified version 
of scattering on a shielded well which we discuss later on. 

We assume, as in the Wigner decay problem that we had been analyzed in a previous lecture, that the scattering 
region contains a single level of energy Eq. Due to the coupling with the continuum states of the lead, this level 
acquire withs Tq. This width would determine the Wigner decay rate if initially the particle were prepared in this 
level. But we would like to explore a different scenario in which the particle is scattered from the outside. Then we 
shall see that Tq determines the Wigner delay time of the scattering. 

In order to perform the analysis using the a formal scattering approach we need a better characterization of the lead 
states \k). For simplicity of presentation it is best to imagine the leas as a ID segment of length L (later we take L to 
be infinite). The density of states in the energy range of interest is L/{ttve)- The coupling between the Eq state and 
any of the volume-normalized k states is Vo.fc = ly/VX. The coupling parameter w can be calculated if the form the 
barrier or its transmission are known (see appropriate "QM is practice I" sections). Consequently we get the the FGR 
width of the level Eq is Fq = [2/ve)w'^. Using a formal language it means that the resolvent of the Eq subsystem is 

^(^^ = F F ! vr (1204) 
E - Eq + i[Vq/2) 

In order to find the S matrix, the clement (s) of the T matrix should be calculated in a properly normalized basis. 
The relation between the fiux-normalized states and the volume-normalized states is: 

^ ^e-'*^^ - -^e+'^'-' (1205) 

Consequently we get for the 1x1 scattering matrix the expected result 

S{E) = \~iT = \-zVk^,QG{E)VQ,k, = \-i (1206) 

E - Eq+ 2(Fo/2) 

which implies a phase shift 

ro/2 




i5n = arctan 



E — E( 

and accordingly the time delay is 



(1207) 



"° = {E-EQf\{TQ/2f ^'^'^^ 
Note that at resonance the time delay is of order I/Fq. 

The procedure is easily generalized in order to handle several leads, say two leads as in the double barrier problem. 
Now we have to use an index a = 1, 2 in order to distingush the left and right channels. The width of the Eq levels is 
Fq = {2/vE)[wf + The freewaves solution is the leads are labled as |(/)^'"), and the S matrix comes out 



SabiE) = Sa.b - iTab = Sa,b - 



E-Eq + i{Tq/2) 



2WaWb 



(1209) 



We see that the maximum transmission is for scattering with E = Eq, namely \2waWi,/{w'f + t«2)p7 which becomes 
100% transmission if wi — W2 as expected from the Fabry-Pcrrot analysis. 
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== [44.3] Scattering resonance of a shielded ID well 

A shielded ID well is defined by 

V{r) = VeiR-r) + US{r^R) (1210) 

where V is the potential floor inside the scattering region Q < r < R, and [/ is a shielding potential barrier at the 
boundary r = i? of the scattering region. We add the shield in order to have distinct narrow resonances. The interior 
wave function is 



ip{r) = SIN(ar) (1211) 

where a — ^2m\E — V\. and SIN is either sin or sinh depending on whether E is larger or smaller than V . Taking 
the logarithmic derivative at r i? and realizing that the effect of the shield \J is simply to boost the result we get: 



1 dil}{T) 



ip{r) dr 



= aCTG{aR) + 2mU (1212) 

=R+0 



where CTG is either cot or coth depending on the energy E. It should be realized that fco depends on the energy as 
well as on V and on U. For some of the discussions below, and also for some experimental application it is convenient 
to regard E as fixed (for example it can be the Fermi energy of an electron in a metal), while V is assumed to be 
controlled (say by some gate voltage). The dependence of kg on V is illustrated in the following figure. At V = E 
there is a smooth crossover from the "cot" region to the "coth" region. If V is very large then fco = oo. This means 
that the wavefunction has to satisfy Dirichlet (zero) boundary conditions at r = i?. This case is called "hard sphere 
scattering" : the particle cannot penetrate the scattering region. If U is very large then still for most values of V we 
have "hard sphere scattering" . But in the latter case there are narrow strips where fcg has a wild variation, and it 
can become very small or negative. This happens whenever the CTG term becomes negative and large enough in 
absolute value to compensate the positive shield term. We refer to these strips as "resonances" . The locations V ^ Vr 
of the resonances is determined by the equation tan(ai?) ~ 0. We realize that this would be the condition for having 
a bound state inside the scattering region if the shield were of infinite height. 




In order to find the phase shift we use the standard matching procedure. In the vicinity of a resonance we can linearize 
the logarithmic derivative with respect to the control parameter V as ko{V) (V — Vr)/vr or if V is fixed, then with 
respect to the energy E as ko{E) w —{E'~Er)/vr, where the definitions of either Vr or Er and Vr arc implied by the 
linearization procedure. The two procedures arc equivalent but for clarity we prefer the former. Thus in the vicinity 
of the resonance we get for the the phase shift 



Sf) = Sn 



arctan 



ko{V) 



= — fcfii? + arctan 



r./2 



V-Vr 



(1213) 



where F^. = 2vrkE- The approximation above assumes well separated resonances. The distance between the locations 
Vr of the resonances is simply the distance between the metastable states of the well. Let us call this level spacing Aq. 
The condition for having a narrow resonance is F^ < Ag. By inspection of the plot it should be clear that shielding 
(large U) shifts the plot upwards, and consequently Vr and hence F^ become smaller. Thus by making U large enough 
we can ensure the validity of the above approximation. 
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In order to get the Wigner time delay we regard V as fixed, and plot the variation of ko{E) as a function of E. This 
plot looks locally the same as the ko{V) plot, with E —V. Then we can obtain as a function of E, which is 
illustrated in the figure below. The phase shift is defined modulo tt, but in the figure it is convenient not to take the 
modulo so as to have a continuous plot. At low energies the s-scattering phase shift is So{E) = —ksR and the time 
delay is r sa —2R/ve- As the energy is raised there is an extra tt shift each time that E goes through a resonance. 
In order to ensure narrow resonance one should assume that the well is shielded by a large barrier. At the center of 
a resonance the time delay is of order l/r^. 




== [44.4] Scattering resonance of a spherical shielded well 

The solution of the £ > 1 version of the shielded well scattering problem goes along the same lines as in the £ = 
case that has been discussed above. The only modification is a change of convention: we work below with the 
radial functions R{r) = u(r)/r and not with u{r). Accordingly for the logarithmic derivative on the boundary of the 
scattering region we use the notation kg instead of ki. Once we know the ki the phase shift can be calculated using 
the formula 



hf \ h{V) - (/i^ /h^ )ki 



hi 



ki{V) - ih'+/h+)kE 



(1214) 



In what follows we fix the energy E of the scattered particle, and discuss the behavior of the phase shift and the cross 
section as a function of V. In physical applications V can be interpreted as some "gate voltage". Note that in most 
textbooks it is customary to fix V and to change E. We prefer to change V because then the expansions are better 
controlled. In any case our strategy gives literally equivalent results. Following Messiah p. 391 and using the notations 



ht 



ht 



(1215) 



we write 



A2St 



MS? 



ki{V) — e + ij 
ki{V) - e-ij 



(1216) 



which gives 



arctan 



ke{V)~e 



(1217) 



We can plot the right hand side of the last equation as a function of V. If the shielding is large we get typically 
Si sa as for a hard sphere. But if V is smaller than E we can find narrow resonances as in the £ = quasi ID 
problem. The analysis of these resonances is carried out exactly in the same way. Note that for £ > wc might have 
distinct resonances even without shielding thanks to the centrifugal barrier. 



199 



QM in Practice (part III) 



[45] The Aharonov-Bohm effect 



[45.1] The Aharonov-Bohm geometry 



In the quantum theory it is natural to treat the potentials V, A as basic variables and £, B as derived variables. Below 
we will discuss the case in which £ = S = in the area where the particle is moving. According to the classical theory 
we expect that the motion of the particle will not be affected by the field, since the Lorentz force is zero. However, 
we will sec that according to the quantum theory the particle will be affected, since A^i). This is a topological effect 
that we are going to clarify. 

In what follows we discuss a ring with magnetic flux $ through it. This is called Aharonov-Bohm geometry. To have 
flux through the ring means that: 

A-dl = j jBds = <i> (1218) 

Therefore the simplest possibility is 
$ 

A = - (1219) 
L 

where L is the length of the ring. Below we will "spread out" the ring so that we get a particle in the interval 
< a; < L with periodical boundary conditions. 



The eigenstates of Ti. are the momentum states |fc„) where: 
27r 

k„ = — X integer (1221) 

The eigenvalues are: 

1 /27rft e$\^ 1 f2Tih\^ f V 



2m V i " cl) 2m \ L ) \ 2TThc ) "-^^^^^ 

The unit 2'Khc/e is called "fluxon". It is the basic unit of flux in nature. We see that the energy spectrum is influenced 
by the presence of the magnetic flux. On the other hand, if we draw a graph of the energies as a function of the flux 
we see that the energy spectrum repeats itself every time the change in the flux is a whole multiple of a fluxon. (To 
guide the eye we draw the ground state energy level with thick line). 
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<1> [fluxons] 



The fact that the electron is located in an area where there is no Lorentz force £ = B ^ 0, but is still influenced by 
the vector potential is called the Aharonov Bohm Effect. This is an example of a topological effect. 



^^^^= [45.2] The energy levels of a ring with a scatterer 

Consider an Aharonov-Bohm ring with (say) a delta scatterer: 



1 / e$' 
i/=- p--) +uSi.) 



(1223) 



We would like to find the eigenenergies of the ring. The standard approach is to write the general solution in the empty 
segment, and then to impose the matching condition over the delta scatterer. A simpler method of solution, which 
is both elegant and also more general, is based on the scattering approach. In order to characterize the scattering 
within the system, the ring is cut at some arbitrary point and the S matrix of the open segment is specified. It is 
more convenient to use the row-swapped matrix, such that the transmission amplitudes are along the diagonal: 



The periodic boundary conditions imply the equation 



(1224) 



which has a non-trivial solution if and only if 
det(S(i;) - 1) = 

Using 

det(S - /) = det(S) - trace(S) + 1 
det(S) = (e*^)2 
trace(S) = 2y/ge'-'^ cos cj) 



(1225) 



(1226) 



(1227) 
(1228) 
(1229) 
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we get the desired equation: 



cos(7(£;)) = ^/g{E) cos(0) (1230) 
where (p — e$/fi,. With a delta scatterer the transmission of the ring is 



and the associated phase shift is 

-f{E) = ksL - avciimi -^u] (1232) 

Note that the free propagation phase is included in 7. In order to find the eigen-energies we plot both sides as a function 
of E. The left hand side oscillates between —1 and +1, while the right hand side is slowly varying monotonically. 

== [45.3] Perturbation theory for a ring + scatterer 



Let us consider a particle with mass m on a ID ring. A flux <f> flows through the ring. In addition, there is a scatterer 
on the ring, described by a delta function. The Hamiltonian that describes the system is: 



If 



^ = ^\P-L^ (1233) 

This Hamiltonian's symmetry group is 0(2). This means symmetry with respect to rotations and reflections. In fact, 
in one dimension, rotations and reflections are the same (since a ID ring = a circle). Only in higher dimensions they 
arc different (torus ^ sphere). 

Degeneracies are an indication for symmetries of the Hamiltonian. If the eigenstate has a lower symmetry than the 
Hamiltonian, a degeneracy appears. Rotations and reflections do not commute, that is why we have degeneracies. 
When we add flux or a scatterer, the degeneracies open up. Adding flux breaks the reflection symmetry, and adding 
a scatterer breaks the rotation symmetry. Accordingly, depending on the perturbation, it would be wise to use one 
of the following two bases: 

The first basis: The first basis complies with the rotation (=translations) symmetry: 

|n - 0) = (1234) 



|n, anticlockwise) 

|n, clockwise) = —=e~^'''^^ 



Where fc„ = ^71 with n = 1,2,3, . . .. The degenerate states are different under reflection. Only the ground state 
|n = 0) is symmetric under both reflections and rotations, and therefore it does not have to be degenerate. It is very 
easy to calculate the perturbation matrix elements in this basis: 

(ji\8{x)\m) = J «'"(a;)(5(x)«'™(a;)dx = *"(0)1'"(0) = j (1235) 
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so we get: 



L 



/ 1 1 1 1 
1111 
1111 
1111 



(1236) 



The second basis: The second basis complies with the reflection symmetry: 
|n = 0) 

■ cos{k„x) 

■ sin(A:„x) 



\n, - 



1 

2 
I 
2" 
L 



(1237) 



The degeneracy is between the even states and the odd states that are displaced by half a wavelength with respect 
to each other. If the perturbation is not the flux but rather the scatterer, then it is better to work with the second 
basis, which complies with the potential's symmetry. The odd states are not influenced by the delta function, and 
they are also not "coupled" to the even states. The reason is that: 



{m\6{x)\n) 



*"(x)5(a;)«'"(a;)dx==0 



(1238) 



if one of the states is odd. (the sine function is not influenced by the barrier, because it is zero at the barrier). 

Consequently the subspace of odd states is not influenced by the perturbation, i.e. Vnm ~ 0, and we only need to 
diagonalizc the block that belongs to the even states. It is very easy to write the perturbation matrix for this block: 



/I V2 V2 V2 

V2 2 2 2 

V2 2 2 2 

\/2 2 2 2 

V 



(1239) 



Energy levels: Without a scatterer the eigenenergies are: 



1 /2n . $^ ^ 



Eni<i>,e^O) = _ _ X integer - - , integer = 0, ±1, ±2, ... (1240) 

zm y L L ' 

On the other hand, in the limit e cx) the system does not "feel" the flux, and the ring becomes a one-dimensional 
potential. The eigenenergies in this limit are: 

£;„($, e=(X)) = -5- (y X integer) , integer = 1, 2, 3, ... (1241) 

As one increases e the number of the energy levels does not change, they just move. See figure below. We would like 
to use perturbation theory in order to find corrections to the above expression. We consider how we do perturbation 
theory with respect to the e=0 Hamiltonian. It is also possible to carry out perturbation theory with respect to the 
e=oo Hamiltonian (for that one should use the formula for the interaction at a junction). 
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The corrections to the energy: The first-order correction to the energy of the states is: 

En=o = 41o + { (1242) 

The correction to the ground state energy, up to the second order is: 
Where we have used the identity: 

°° 1 2 

= Y (1244) 

k=l 



What happens if we choose the first basis: We will now assume that we did not notice the symmetry of the 
problem, and we chose to work with the first basis. Using perturbation theory on the ground state energy is simple 
in this basis: 

But using perturbation theory on the rest of the states is difficult because there are degeneracies. The first thing we 
must do is "degenerate perturbation theory". The diagonalization of each degenerate energy level is: 



in /2 

1 ij ^ lo 



(1246) 



Now we must move to the "new" basis, where the degeneracy is removed. This is exactly the "second" basis that we 
chose to work with because of symmetry considerations. The moral is: understanding the symmetries in the system 
can save us work in the calculations of perturbation theory. 
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== [45.4] The Aharonov-Bohm effect in a closed geometry 

The eigen-energies of a particle in a closed ring are periodic functions of the flux. In particular in the absence of 
scattering 

En = Tr-[^] [n-^\ =::mvl (1247) 



2m\Lj V 2TTnc J 2 
That is in contrast with classical mechanics, where the energy can have any positive value: 

^classical = (1248) 

According to classical mechanics the lowest state of a magnetic field is energy zero with velocity zero. This is not true 
in quantum mechanics. In other words: when we add magnetic flux, we can observe its effect on the system. The 
effect can be described in one of the following ways: 

• The spectrum of the system changes (it can be measured using spectroscopy) 

• For flux that is not an integer or half integer number there are persistent currents in the system. 

• The system has either a diamagnetic or a paramagnetic response (according to the occupancy). 

We already have discussed the spectrum of the system. So the next thing is to derive an expression for the current in 
the ring. The current operator is defined as (see digression): 



dn _ e 



yV (1249) 



It follows that The current which is created by an electron that occupies the nth level is: 



(1250) 



By looking at the plot of the energies i?„($) as a function of the flux, we can determine (according to the slope) the 
current that flows in each occupied energy level. If the flux is neither integer nor half integer, all the states "carry 
current" so that in equilibrium the net current is not zero. This phenomenon is called "persistent currents". The 
equilibrium current in such case cannot relax to zero, even if the temperature of the system is zero. 

There is a statement in classical statistical mechanics that the equilibrium state of a system is not affected by magnetic 
fields. The magnetic response of any system is a quantum mechanical effect that has to do with the quantization of 
the energy levels (Landau magnetism) or with the spins (Pauly magnetism). Definitions: 

• Diamagnetic System - in a magnetic field, the system's energy increases. 

• Paramagnetic System - in a magnetic field, the system's energy decreases. 

The Aharonov Bohm geometry provides the simplest example for magnetic response. If we put one electron in a 
ring, then when we increase the magnetic flux slightly, the system energy increases. We say that the response is 
"diamagnetic". The electron cannot "get rid" of its kinetic energy because of the quantization of the momentum. 



== [45.5] Dirac Monopoles 

Yet another consequence of the " Aharonov Bohm" effect is the quantization of the magnetic charge. Dirac has claimed 
that if magnetic monopoles exist, then there must be an elementary magnetic charge. The formal argument can be 
phrased as follows: If a magnetic monopole exists, it creates a vector potential field in space {A{x)). The effect of 
the field of the monopole on an electron close by is given by the line integral / A ■ dr. We can evaluate the integral 
by calculating the magnetic flux $ through a Stokes surface. The result cannot depend on the choice of the surface, 
otherwise the phase will not be well defined. Therefore the phase (j) = e^/hc must be zero modulo 27r. Specifically, we 
may choose one Stokes surface that passes over the monopole, and one Stokes surface that passes under the monopole. 
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and conclude that the net flux must be an integer multiple of 27r?ic/e. By using "Gauss law" we conclude that the 
monopole must have a magnetic charge that is an integer multiple of hc/2e. 

Dirac's original reasoning was somewhat more constructive. Let us assume that a magnetic monopole exist. The 
magnetic field that would be created by this monopole would be like that of a tip of a solenoid. But we have to 
exclude the region in space where we have the magnetic flux that goes through the solenoid. If we want this "flux 
line" to be unobservable then it should be quantized in units of 2'Khc/e. This shows that Dirac "heard" about the 
Aharonov Bohm effect, but more importantly this implies that the "tip" would have a charge which equals an integer 
multiple of hc/2e. 



== [45.6] The Aharonov-Bohm effect: path integral formulation 

We can also illustrate the Aharonov-Bohm Effect in an open geometry. In an open geometry the energy is not 
quantized, but rather it is determined ahead of time. We are looking for stationary states that solve the Schrodinger 
equation for a given energy. These states are called "scattering states". Below we will discuss the Aharonov-Bohm 
effect in a "two slit" geometry and then in a "ring" geometry with leads. 

First we notice the following rule: if we have a planar wave = e*'^'^, then if the amplitude at the point x = xi is 
ip{xi) = C, then at another point x = X2 the amplitude is ipi^^) = Ce''^^^^"^^^. 

Now we will generalize this rule for the case in which there is a vector potential A. For simplicity's sake, we will 
assume that the motion is in one-dimension. The eigenstates of the Hamiltonian are the momentum states. If the 
energy of the particle is E then the wavefunctions that solve the Schrodinger's equation are ip(x) = e**^*'^. Where: 



k± = ±V2mE + A EE ±kE + A (1251) 

Below we will treat the advancing wave: if at point x = xi the amplitude is ^'(^^i) = A, then at another point x = X2 
the amplitude is ip{x2) = Ae'*'^^^^~^^-'+'^ '^^~^^-'. It is possible to generalize the idea to three dimensions: if a wave 
advances along a certain path from point xi to point X2 then the accumulated phase is: 

(j) = fcs X distance -I- / A ■ dx (1252) 
If there are two different paths that connect the points xi and X2, then the phase difference is: 

A(j) ^ ks ■ Adistance +/ A-dx- A-dx^ (1253) 

Jcurvc2 Jcurvcl 

= kE ■ /distance + (j) A ■ dx = 

= fc_B • Zklistance + — $ 
he 

Where in the last term we "bring back" the standard physical units. The approach which was presented above for 
calculating the probability of the particle to go from one point to another is called " path integrals" . This approach was 
developed by Feynman, and it leads to what is called "path integral formalism" - an optional approach to calculations in 
Quantum Mechanics. The conventional method is to solve the Schrodinger's equation with the appropriate boundary 
conditions. 



== [45.7] The Aharonov-Bohm effect in a two slits geometry 

We can use the path integral point of view in order to analyze the interference in the two slit experiment. A particle 
that passes through two slits, splits into two partial waves that unite at the detector. Each of these partial waves 
passes a different optical path. Hence the probability of reaching the detector, and consequently the measures intensity 
of the beam is 



intensity = |1 x e'''^! + 1 x e''"'^p cx 1 + cos(fc(r2 - n)) 



(1254) 
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If we mark the phase difference as (f) = k{r2 — fi), then we can write the result as: 



intensity oc 1 + cos </> (1255) 

Changing the location of the detector causes a change in the phase difference (/>. The " intensity" , or more precisely 
the probability that the particle will reach the detector, as a function of the phase difference 4> makes an "interference 
pattern" . If we place a solenoid between the slits, then the formula for the phase difference will be: 

= A:(r2 -ri) + ^$ (1256) 
Tic 

If we draw a graph of the "intensity" as a function of the flux we will get the same graph as we would get if we 
changed the location of the detector. In other words, we will get an "interference pattern" as a function of the flux. 



Slits 



Source 



Screen 



Detector 



If we want to find the transmission of an Aharonov-Bohm device (a ring with two leads) then we must sum all the 
paths going from the first lead to the second lead. If we only take into account the two shortest paths (the particle 
can pass through one arm or the other arm), then we get a result that is formally identical to the result for the two 
slit geometry. In reality we must take into account all the possible paths. That is a very long calculation, leading to 
a Fabry- Perrot type result (see previous lecture). In the latter case the transmission is an even function of $, even if 
the arms do not have the same length. Having the same transmission for ±$ in the case of a closed device is implied 
by time reversal symmetry. 
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[46] Motion in uniform magnetic field (Landau, Hall) 
== [46.1] The two-dimensional ring geometry 



Let us consider a three-dimensional box with periodic boundary conditions in the x direction, and zero boundary 
conditions on the other sides. In the absence of magnetic field we assume that the Hamiltonian is: 



zm 



Zm " 



(1257) 



The eigenstates of a particle in such box are labeled as 



\k^,ny,n,) (1258) 

27r . 

^ — X mteger 

ny,nz = 1,2,3... 
The eigenenergies are: 

^^^— = I; +^- + 2^(1:"^)' (1259) 

We assume to be very small compared to the other dimensions. We will discuss what happens when the system is 
prepared in low energies such that only riz ~ \ states are relevant. So we can ignore the z axis. 

== [46.2] Classical Motion in a uniform magnetic field 



Below we will discuss the motion of electrons on a two-dimensional ring. We assume that the vertical dimension 
is "narrow", so that we can safely ignore it, as was explained in the previous section. For convenience's sake we 
will "spread out" the ring so that it forms a rectangle with periodical boundary conditions on Q < x < L.^, and an 
arbitrary potential V{y) on axis y. In addition, we will assume that there is a uniform magnetic field B along the z 
axis. Therefore, the electrons will be affected by a Lorentz force F = —eB x v. If there is no electrical potential, the 
electrons will perform a circular motion with the cyclotron frequency: 

eB 

= — (1260) 
mc 

If the particle has a kinetic energy E, then its velocity is: 

VE = (1261) 
V m 

And then it will move in a circle of radius: 

rE = ^ = ^v, (1262) 
ujb eB 

If we take into account a non-zero electric field 



(1263) 
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we get a motion along a cycloid with the drift velocity (see derivation below): 



"drift 



(1264) 



Let us remind ourselves why the motion is along a cycloid. The Lorentz force in the laboratory reference frame is 
(from now on we will "swallow" the charge of the electron in the definition of the field): 



F=£-Bxv 



(1265) 



If we move to a reference frame that is moving at a velocity vq we get: 



F^£ -Bx {i/ + va) = {£ + vo X B) - B X v' 



(1266) 



Therefore, the non-relativistic transformation of the electromagnetic field is: 



£' = £ + voX B 
B' = B 



(1267) 



If we have a field in the direction of the y axis in the laboratory reference frame, we can move to a "new" reference 
frame where the field will be zero. From the transformation above we conclude that in order to zero the electrical 
field, the velocity of the "new" system must be: 



v, = c- 



(1268) 



In the "new" reference frame the particle moves in a circle. Therefore, in the laboratory reference frame it moves 
along a cycloid. 




(X ) 




Conventionally the classical state of the particle is described by the coordinates r = (x, y) and v = {vx^ Vy). But from 
the above discussion it follows that a simpler description of the trajectory is obtained if we follow the motion of the 
moving circle. The center of the circle R = {X,Y) is moving along a straight line, and its velocity is v^^.-^^^. The 
transformation that relates R to r and v is 



R 



1 ^ 

Bz X V 

LOB 



(1269) 



where Cz is a unit vector in the z direction. The second term in the right hand side is a vector of length te in the 
radial direction (perpendicular to the velocity). Thus instead of describing the motion with the canonical coordinates 
(x, y, Vx, Vy), we can use the new coordinated {X, Y, Vx, Vy). 
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[46.3] The Hall Effect 



If we have particles spread out in a uniform density p per unit area, then the current density per unit length is: 

where V is the electrical potential (measured in Volts) and hence the extra e. The total current is: 

J.dy = -p—{V{v2) - V{y,)) = -p-(/.2 - Mi) (1271) 
So that the Hall conductance is: 

%all = (1272) 

Classically we have seen that (x (dV/dy). We would like to derive this result in a quantum mechanical way and to 
find the quantum Hall conductance. In the quantum analysis we shall see that the electrons occupy "Landau levels". 
The density of electrons in each Landau Level is B/2Trhc. From this it follows that the Hall conductance is quantized 
in units of e^/27rft, which is the universal unit of conductance in quantum mechanics. 

We note that both Ohm's law and Hall's law should be written as: 

/ = Gx i(^2-/^i) (1273) 
and not as: 

I = Gx{V2-Vi) (1274) 

Where n is the electrochemical potential. When the electrical force is the only cause for current, then the elec- 
trochemical potential is simply the electrical potential (multiplied by the charge of the electron). At zero absolute 
temperature fj, can be identified with the Fermi energy. In metals in equilibrium, according to classical mechanics, 
there are no currents inside the metal. This means that the electrochemical potential must be uniform. This does 
not mean that the electrical potential is uniform! For example: when there is a difference in concentrations of the 
electrons (e.g. different metals) then there should be a "contact potential" to balance the concentration gradient, 
so as to have a uniform electrochemical potential. Another example: in order to balance the gravitation force in 
equilibrium, there must be an electrical force such that the total potential is uniform. In general, the electrical field 
in a metal in equilibrium cannot be zero! 



== [46.4] Electron in Hall geometry: Landau levels 

In this section we will see that there is an elegant and formal way of treating the problem of an electron in Hall 
geometry within the framework of the Hamiltonian formalism. This method of solution is valid both in classical 
mechanics and in quantum mechanics (all one has to do is to replace the Poisson brackets with commutators) . In the 
next lecture we will solve the quantum problem again, using the conventional method of "separation of variables". 
Below we use the Landau gauge: 



A = {-By, 0,0) 

B = V X 1= (0,0,6) 



(1275) 
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Therefore, the Hamiltonian is (from here we will "swallow" the charge of the electron in the definition of the field): 

n=:^{p,+Byf + :^{pyf + V{y) (1276) 
zm zm 

We define a new set of operators: 

v, = -{p, + By) (1277) 
m 

1 

= -Py 

X = X + —Vy = X + l-Py 

LOB B " 

Y = y vx = --f^Px 

uJB B 

We notice that from a geometrical perspective, X,Y represent the "center of the circle" that the particle moves in. 
We also notice the commutation relations: the operators X,Y commute with Vx,Vy. On the other hand: 

=-*( — )-! (1278) 
K,vy]=i— — 

So that we can define a new set of canonical coordinates: 

(1279) 





mc 


Qi 


eB 


Pi 


= mvy 


Q2 


= Y 






Pi 






c 



And rewrite the Hamiltonian as: 

W(Oi, Pi, O2, P2) = ^P? + IrmulQl + V{Qi + Q2) (1280) 
zm z 

We see (as expected) that Q2 = is a constant of motion. We also see that the kinetic energy is quantized in units 
of lub- Therefore, it is natural to mark the eigenstates as \Y, v), where: = 0, 1, 2, 3 . . ., is the kinetic energy index. 



== [46.5] Electron in Hall geometry: The Landau states 

The Hamiltonian that describes the motion of a particle in the Hall bar geometry is: 

n = ^iPx+Byr + :^{Py)^ + Viy) (1281) 
zm zm 

Recall that we have periodic boundary conditions in the x axis, and an arbitrary confining potential V{y) in the y 
direction. We also incorporate a homogeneous magnetic field B ~ (0, 0,B). The vector potential in the Landau gauge 
is A = {—By, 0, 0). We would like to find the eigenstates and the eigenvalues of the Hamiltonian. 

The key observation is that in the Landau gauge the momentum operator p^ is a constant of motion. It is more 
physical to re-phrase this statement in a gauge independent way. Namely, the constant of motion is in fact 
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which represents the y location of the classical cycloid. In fact the eigenstates that we are going to find are the 
quantum mechanical analog of the classical cycloids. The eigenvalues affix are iir/ x integer, Equivalcntly, we may 
write that the eigenvalues of Y are: 

2-ir 

Yi = —i, integer] (1283) 

That means that the y distance between the eigenstates is quantized. According to the "separation of variables 
theorem" the Hamiltonian matrix is a block diagonal matrix in the basis in which the Y matrix is diagonal. It is 
natural to choose the basis \i, y) which is determined by the operators Px, y- 

{lMW',y') = Si,i'Hly, (1284) 
It is convenient to write the Hamiltonian of the block £ in abstract notation (without indexes): 



Or, in another notation: 

n' = + V,{y) (1286) 

zm 

where the effective potential is: 

Vi^y) = y{y) + \^^l{y - Y,f (i287) 

For a given ^, we find the eigenvalues \£tv) of the one-dimensional Hamiltonian 7i^. The running index is 
1^ = 0,1,2,3,.... 

For a constant electric field we notice that this is the Schrodinger equation of a displaced harmonic oscillator. More 
generally, the harmonic approximation for the effective potential is valid if the potential V{y) is wide compared to 
the quadratic potential which is contributed by the magnetic field. In other words, we assume that the magnetic field 
is strong. We write the wave functions as: 

^ ^e-'(«^^)^v'^^)(y-y,) (1288) 



We notice that BYi are the eigenvalues of the momentum operator. If there is no electrical field then the harmonic 
approximation is exact, and then ip^''''^ are the eigenfunctions of a harmonic oscillator. In the general case, we must 
"correct" them (in case of a constant electric field they are simply shifted). If we use the harmonic approximation 
then the energies are: 

Et,, « V{Yt) + {^^y)ujB (1289) 
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Plotting Eg I, against Yg we get a picture of "energy levels" that are called "Landau levels" (or more precisely they 
should be called "energy bands"). The first Landau level is the collection of states with i/ = 0. We notice that the 
physical significance of the term with i' is the kinetic energy. The Landau levels are "parallel" to the bottom of 
the potential V{y). If there is an area of width Ly where the electric potential is flat (no electric field), then the 
eigenstates in that area (for a given v) will be degenerate in the energy (they will have the same energy). Because of 
the quantization of Yi the number of particles that can occupy a band of width Ly in each Landau level is finite: 

giandau = ^^/^ = (1290) 



[46.6] Hall geometry with AB flux 



Here wc discuss a trivial generalization of the above solution which will help us in the next section. Let us assume 
that wc add a magnetic flux $ through the ring, as in the case of Aharonov-Bohm geometry. In this case, the vector 
potential is: 



A= ( -i--6y,0,0 



(1291) 



We can separate the variables in the same way, and get: 
Ei,^ « V (Ye + -J—<i>] + l + iy UJB 



(1292) 
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== [46.7] The Quantum Hall current 

We would like to calculate the current for an electron that occupies a Landau state \£, v): 

JAx,y) = ^{evj{x-x)6{y^y) + h.c.) (1293) 

If- e 
Jx{y) = -;— Jxix,y)dx ^ -j—VxSiy-y) 

J ^x 

= = Tx"^ = lMy)dy 

Recall that Vx = {eB/m){y — Y), and that the Landau states are eigenstates of Y. Therefore, the current density of 
an occupied state is given by: 

rj{y) = {MUy)m -P-{{y- y) v)) = ^iv ~ y^) W'iy - y,)\' (1294) 

If we are in the region (Y^ < y) we observe current that flows to the right (in the direction of the positive x axis), and 
the opposite if we are on the other side. This is consistent with the classical picture of a particle moving clockwise in 
a circle. If there is no electric field, the wave function is symmetric around Yf , so wc get zero net current. If there 
is a non-zero electric field, it shifts the wave function and then we get a net current that is not zero. The current is 
given by: 



ri. I ri^. ...... dE„. _ 1 '^^(y) 



/r = J^-{y)dy 



9$ BLx dy 
For a Fermi occupation of the Landau level we get 



(1295) 

y=Yt 



V /^^ - r f 1 ^^(2^) ^ ri296) 



~^{V{y2)-V{y,)) = -^i^l2-^^l) 



In the last equation we have restored the standard physical units. We see that if there is a chemical potential difference 
we get a current Ix- The Hall coefficient is e/27r/i times the number of full Landau levels. In other words: the Hall 
coefficient is quantized in units of e/2TTh. 
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[47] Motion in a central potential 



[47.1] The Hamiltonian 



Consider the motion of a particle under the influence of a sphericaUy symmetric potential that depends only on the 
distance from the origin. We can write the Hamiltonian in spherical coordinates. 

n = ^p^ + eV{r) ^ ^{pl + }-L^)^eV{r) (1297) 

where 



L = rxp (1298) 
Pr^-l^r (1299) 



The Hamiltonian commutes with rotations: 

[n,R]^ (1300) 
And in particular: 

['H,L^]=0 (1301) 

[n. L,] = 

According to the separation of variables theorem the Hamiltonian becomes block diagonal in the basis which 
is determined by and L^. The states that have definite £ and m quantum numbers are of the form 
ip{x, y, z) = R{r)Y^^^ {9 , f), so there is some freedom in the choice of this basis. The natural choice is \r, i, m) 

(r,0,v5|ro,4,mo) ->^'°'"°(e,¥')-'5(r-ro) (1302) 

r 

These are states that "live" on spherical shells. Any wavcfunction can be written as a linear combination of the states 
of this basis. Note that the normalization is correct (the volume element in spherical coordinates includes r^). The 
Hamiltonian becomes 

(r, £, m\n\r', £' , m') = Si^tSm^rn-ni'^'T^ (1303) 

2m \ 2mr-^ / 2m 

Where p s- -~i{d/dr). The wavcfunction in the basis which has been defined above are written as 

1^) = 5^ 7/,™(r) |r,£,m) ^ ^ !f^!^y^^™(0, ^) (1304) 

In the derivation above we have made a " shortcut" . In the approach which is popular in textbooks the basis is not 
properly normalized, and the wave function is written as V'('', S, ip) = ip{x, y, z) — R{r)Y^'™'{9, ip), without taking the 
correct normalization measure into account. Only in a later stage they define u(r) = rR{r). Obviously eventually 
they get the same result. By using the right normalization of the basis we have saved an algebraic stage. 

By separation of variables the Hamiltonian has been reduced to a semi-one-dimensional Schrodinger operator acting 
on the wave function u{r). By " semi-onc-dimensional" we mean that < r < oo. In order to get a wave function 
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tjj{x,y,z) that is continuous at the origin, wc must require the radial function R{r) to be finite, or alternatively the 
function u{r) has to be zero at r = 0. 



^^^^= [47.2] Eigenstates of a particle on a spherical surface 

The simplest central potential that we can consider is such that confine the particle to move within a spherical shell 
of radius R. Such potential can be modeled as V{r) = ~X6{r — R). For £ = we know that a narrow deep well has 
only one bound state. We fix the energy of this state as the reference. The centrifugal potential for £ > simply lifts 
the potential floor upwards. Hence the eigen-energies are 

Elm = + (1305) 

We remind ourselves of the considerations leading to the degeneracies. The ground state F^", has the same symmetry 
as that of the Hamiltonian: both invariant under rotations. This state has no degeneracy. On the other hand, the 
state Y^'^ has a lower symmetry and by rotating it we get 3 orthogonal states with the same energy. The degeneracy 
is a "compensation" for the low symmetry of the eigenstates: the symmetry of the energy level as a whole (i.e. as a 
subspace) is maintained. 

We mark the number of states up to energy E by N{E). It is easy to prove that the density of states is: 
dN m 



dE 2T:h? 



A (1306) 



Where A is the surface area. It can be proved that this formula is valid also for other surfaces. The most trivial 
example is obviously a rectangular surface for which 

E..m = + (1307) 

The difference between the E^m spectrum of a particle on a sphere and the En^m spectrum of a particle on a rectangular 
is in the way that the eigenvalues are spaced, not in their average density. The degeneracies of the spectrum are 
determined by the symmetry group of the surface on which the motion is bounded. If this surface has no special 
symmetries the spectrum is expected to be lacking systematic degeneracies. 



== [47.3] The Hydrogen Atom 

The effective potential V^{r) that appears in the semi-one-dimensional problem includes the original potential plus 
a centrifugal potential (for t ^ 0). Typically, the centrifugal potential -|-l/r^ causes a "potential barrier" (positive). 
But in the case of the Hydrogen atom the attractive potential — l/r wins and there is no such barrier. Moreover, 
unlike typical short range potentials, the potential has an infinite number of bound states in the E < Q range. In 
addition, there are also "resonance" states in the range i? > 0, that can "leak" out through the centrifugal barrier (by 
tunneling) into the continuum of states outside. Another special property of the Hydrogen atom is the high degree 
of symmetry (the Hamiltonian commutes with the Runge-Lentz operators). This is manifested in the degeneracy of 
energy levels that have different I. 

For sake of later reference we write the potential as: 

V{r) = -- (1308) 
r 

Solving the radial equation (for each £ separately) one gets: 
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Where i/ = 1, 2, 3, . . .. The energy levels are illustrated in the diagram below: 




-0.8 



12 3 4 

I 



In the rest of this section we remind the reader why there are degeneracies in the energy spectrum (this issue has been 
discussed in general in the section about symmetries and their implications). If the Hamiltonian has a single constant 
of motion, there will usually not be a degeneracy. According to the separation of variables theorem it is possible to 
change to a basis in which the Hamiltonian will have a block structure, and then diagonalize each block separately. 
There is no reason for a conspiracy amongst the blocks. Therefore there is no reason for a degeneracy. If we still get 
a degeneracy it is called an accidental degeneracy. 

But, if the Hamiltonian commutes with a "non-commutative group" then there are necessarily degeneracies that are 
determined by the dimensions of the irreducible representations. In the case of a central potential, the symmetry 
group is the "rotation group". In the special case of the potential — 1/r a larger symmetry group exists. 

A degeneracy is a compensation for having eigenstates with lower symmetry compared with the Hamiltonian. The 
degree of degeneracy is determined by the dimensions of the irreducible representations. These statements were 
discussed in the concluding Fundamentals II section. Let us paraphrase the argument in the present context: Assume 
that we have found an eigenstate of the Hamiltonian. If it has full spherical symmetry then there is no reason for 
degeneracy. These are the states with £ = 0. But, if we have found a state with a lower symmetry (the states with 
f 7^ 0) then we can rotate it and get another state with the same energy level. Therefore, in this case there must be 
a degeneracy. 

Instead of "rotating" an eigenstate, it is simpler (technically) to find other states with the same energy by using ladder 
operators. This already gives an explanation why the degree of the degeneracy is determined by the dimension of 
the irreducible representations. Let us paraphrase the standard argument for this statements in the present context: 
The Hamiltonian H. commutes with all the rotations, and therefore it also commutes with all their generators and 
also with L^. We choose a basis in which both TL and are diagonal. The index of the energy levels n is 

determined by 7i, while the index I is determined by L/^ . The index ^ differentiates states with the same energy and 
the same i. According to the "separation of variables theorem" every rotation matrix will have a "block structure" in 
this basis: each level that is determined by the quantum numbers (n, €) is an invariant subspace under rotations. In 
other words, Ti together with induce a decomposition of the group representation. Now we can apply the standard 
procedure in order to conclude that the dimension of the (sub) representation which is characterized by the quantum 
number I, is 2£+l. In other words, we have "discovered" that the degree of the degeneracy must be 2^ -f 1 (or a 
multiple of this number). 



[48] The Hamiltonian of a spin 1/2 particle 
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== [48.1] The Hamiltonian of a spinless particle 

The Hamiltonian of a spinless particle can be written as: 

^ 2 2 

n = —{p- eA{r)f + eVir) = ^ - ^{A-p + p- A) + ^A^ + eV{r) (1310) 
2m 2m 2m 2m 

We assume that the field is uniform B ~ (0, 0, So). In the previous lectures we saw that this field can be derived from 
A = (—Soy, 0,0), but this time we use a different gauge, called "symmetrical gauge": 

A= (^-^Boy,lBox,Q^ =^Bxr (1311) 
We will also use the identity (note that triple multiplication is associative): 

A-p=^{Bxr)-p= ^B-{rxp) = ^B-L (1312) 
Substitution into the Hamiltonian gives: 

^ = 1^ + - • ^ + - (1313) 

2m 2m om 

Specifically for a homogeneous field in the z axis we get 

n = ^ + eVir) - ^BoL^ + ^B^x' + y') (1314) 
2m 2m om 

The two last terms are called the " Zeeman term" and the " diamagnetic term" . 

^Zeeman,orbital motion ^ ^2m^ ^ (1315) 



^^^^= [48.2] The additional Zeeman term for the spin 

Spectroscopic measurements on atoms have shown, in the presence of a magnetic field, a double (even) Zeeman 
splitting of the levels, and not just the expected "orbital" splitting (which is always odd). From this Zeeman has 
concluded that the electron has another degree of freedom which is called "spin 1/2". The Hamiltonian should include 
an additional term: 

'^Zeeman,spin = ^^^^ ' ^ (1316) 

where Cq = \e\ is the elementary unit charge. The spectroscopic measurements of the splitting make it possible to 
determine the gyromagnetic coefficient to a high precision. The same measurements were conducted also for protons, 
neutrons (a neutral particle!) and other particles: 

Electron: = -2.0023 
Proton: gp = +5.5854 
Neutron: g„ = -3.8271 
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The implication of the Zecman term in the Hamiltonian is that the wavefunction of the electron precesses with the 
frequency 



2m 



(1317) 



while the spin of the electron precesses with a twice larger frequency 



2m 



g = \ge\ ~ 2 



(1318) 



[48.3] The spin orbit term 



The added Zeeman term describes the interaction of the spin with the magnetic field. In fact, the "spin" degree of 
freedom (and the existence of anti-particles) is inevitable because of relativistic considerations of invariance under 
the Lorentz transformation. These considerations lead to Dirac's Hamiltonian. There are further "corrections" to the 
non-relativistic Hamiltonian that are needed in order to make it "closer" to Dirac's Hamiltonian. The most important 
of these corrections is the " spin orbit interaction" : 

Ti spin- orbit = ^^^(^ XP) ■ S (1319) 

where we implicitly use g ~ 2. In other words, the spin interacts with the electric field. This interaction depends on 
its velocity. This is why the interaction is called spin-orbit interaction. If there is also a magnetic field then we have 
the additional interaction which is described by the Zeeman term. 

We can interpret the "spin-orbit" interaction in the following way: even if there is no magnetic field in the "laboratory" 
reference frame, still there is a magnetic field in the reference frame of the particle, which is a moving reference frame. 
This follows from Lorentz transformation: 

6 = S - Vfj.ame ^ ^ (1320) 

It looks by this argument that there is a missing factor g sa 2 in the spin-orbit term. But it turns out that this factor 
is canceled by another factor of 1/2 that comes from the so called "Thomas precession". 

We summarize this section by writing the common non-relativistic approximation to the Hamiltonian of a particle 
with spin 1/2. 

■H = -^(p - el(r))2 + eVir) ~ g^B ■ S~ -^(£ x p) ■ S (1321) 
zm zm zm 

In the case of spherically symmetric potential V{r) the electric field is 
V'ir) 

£ = --^f (1322) 
r 

Consequently the Hamiltonian takes the form 

Ti = —(pl + + eVir) + DiamagnetieTerm - —B ■ L - g—B ■ S + -^-^^^i • S (1323) 

2m V r"' / 2m ' 2m 2m^ r 



[48.4] The Dirac Hamiltonian 



In the absence of an external electromagnetic field the Hamiltonian of a free particle should be a function of the 
momentum operator alone Ti = h{p) where p = (i, y, z). Thus p is a good quantum number. The reduced Hamiltonian 
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within a,p subspace is Ti^^^ = h{p). If the particle is spineless h{p) is a number and the dispersion relation is e = h{p). 
But if the particle has an inner degree of freedom (spin) then h{p) is a matrix. In the case of Pauli Hamiltonian 
h{p) = [p^ / (2m))l is a 2 X 2 matrix. We could imagine a more complicated possibility of the type h{p) = a ■ p + .... 
In such case p is a good quantum number, but the spin degree of freedom is no longer degenerated: Given p, the 
spin should be polarized either in the direction of the motion (right handed polarization) or in the opposite direction 
(left handed polarization). This quantum number is also known as helicity. The helicity might be a good quantum 
number, but it is a "Lorentz invariant" feature only for a massless particle (like a photon) that travels in the speed 
of light, else one can always transform to a reference frame where p = and the helicity become ill defined. 

Dirac has speculated that in order to have a Lorentz invariant Schrodinger equation {dip/dt — ■■.) for the evolution, 
the matrix h{p) has to be linear (rather than quadratic) in p. Namely h{p) = a ■ p + const/3. The dispersion relation 
should be consistent with ~ + p^ which implies h{p)^ = (m^ +p^)l. It turns out that the only way to satisfy 
the latter requirement is to assume that a and /3 are 4x4 matrices: 







1 
1 



(1324) 



Hence the Dirac Hamiltonian is 



n 



a ■ p + (3m 



(1325) 



It turns out that the Dirac equation, which is the Schrodinger equation with Dirac's Hamiltonian, is indeed invariant 
under Lorentz. Given p there are 4 distinct eigenstates which we label as |p. A). The 4 eigenstates are determined via 
the diagonalization of h{p). Two of them have the dispersion e = + and the other two have the dispersion 

e = — \/ p^ + m^. It also turns out that the helicity (in a give reference frame) is a good quantum number. The helicity 
operator is E • p where 



cr, 



(1326) 



This operator commutes with Dirac Hamiltonian. Thus the electron can be right or left handed with either positive 
or negative mass. Dirac's interpretation for this result was that the "vacuum" state of the universe is like that of an 
intrinsic semiconductor with gap 2mc^. Namely, instead of talking about electrons with negative mass we can talk 
about holes (positrons) with positive mass. The transition of an electron from an occupied negative energy state to 
an empty positive energy state is re-interpreted as the creation of an electron positron pair. The reasoning of Dirac 
has lead to the conclusion that particles like the electron must have a spin as well as antiparticle states. 
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[49] Implications of having "spin" 



[49.1] The Stern- Gerlach effect 



We first discuss tlie effect tliat tlie Zeeman term lias on tlie dynamics of a "free" particle. We will see that because of 
this term, there is a force on the particle if the magnetic field is not homogeneous. For simplicity's sake, we will assume 
that there is a magnetic field in the direction of the Z axis (note that this can be true only as an approximation). If 
we keep only the Zeeman term then the Hamiltonian is: 

n^^-g^B.{x)S. (1327) 
zm zm 

We see that Sz is a constant of motion. That means that if we have prepared a particle in the "up" state, it sees an 
effective potential: 

^efr = -^52^^.W (1328) 

While a particle with spin "down" sees an inverted potential (with the opposite sign). That means that the direction 
of the force depends on the direction of the spin. We can come to the same conclusion by looking at the equations of 
motion. As we have seen before: 

-^(x) = (*[7i,x]) = (i(p-A(x))) (1329) 
dt m 

This still holds with no change. But what about the acceleration? We see that there is a new term: 

-f-(w) = = - <f Lorcntz force + .g-^(VS3)S'^ \ (1330) 

dt m \ 2m / 

The observation that in inhomogeneous magnetic field the force on the particle depends on the spin orientation is 
used in order to measure the spin using a Stern-Gcrlach apparatus. 

== [49.2] The reduced Hamiltonian in a central potential 

The reduced Hamiltonian for with the I sub-space of a particle moving in a central potential can be written as: 

^(^) ^ _ e _ g-^BS. + /(r)L • S (1331) 
zm zm 

The interaction L ■ S commutes with L^, therefore i is still a good quantum number. If we assume that the last terms 
in the Hamiltonian are a weak perturbation that does not "mix" energy levels, then we can make an approximation(!) 
and reduce further by taking the states that have the same energy: 

H'^'') = -hL, - ghSz + VL-S + const (1332) 

Where the first term with h = eB/{2m) is responsible for the orbital Zeeman splitting, and the second term with gh 
is responsible to the spin-related Zeeman splitting. We also use the notation 

v={£,iy\fir)\£,iy) (1333) 



If the spin-orbit interaction did not exist, the dynamics of the spin would become independent of the dynamics of the 
wave function. But even when there is a spin-orbit interaction, the situation is not so bad. L ■ S couples only states 
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with the same i. From now we focus on the second energy level of the Hydrogen atom. The Hamiltonian matrix is 
8x8. But it decomposes trivially into matrix blocks of 2 x 2 and 6x6. We notice that the 2x2 block is already 
diagonalizcd: the only term that could destroy the diagonalization within this block is L • S*, but it is zero in this 
block since £ = 0. So, we only have to diagonalize the 6x6 block. 

Why does L ■ S couple only states with the same The full rotation generator J = L + S rotates both the wave 
equation and the spin. The system is symmetrical to rotations. So, it is obvious that [H, J] ~ and specifically 
[L ■ S, J] = 0. On the other hand, the symmetry: [H, L"^] = [L ■ S, L'^] = is less trivial. The Hamiltonian is not 
symmetrical under rotations generated by L. We must rotate both the wave function and the spin together, in order 
for the Hamiltonian to stay the same. In order to prove the commutation relation with L^, we notice that: 

L • 5 = i ( J2 - - 5-2) (1334) 



We consider the reduced Hamiltonian for the state £ ~ I, v = I, in the standard basis \£ = 1,1^ = l,mi,ms). As 
mentioned above, the Hamiltonian is represented by a 6 x 6 matrix. It is easy to write the matrices for the Zeeman 
terms: 
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(1335) 



o\ 






-1/ 



But the spin-orbit term is not diagonal. In principle, we must make the following calculation: 



L-S^I{J'-L'-S')^1[J! 



Jf. 



(1336) 



Where the most simple term in this expression includes the diagonal matrix: 
Jz = L2 + Sz — *■ [6x6 matrix] 



(1337) 



And there are two additional terms that include 6x6 matrices that are not diagonal . . . We will see later on that 
there is a relatively simple way to find the representation of in the "normal" basis. In fact, it is very easy to 
"predict" what it will look like after diagonalization: 



J' 



/(15/4) \ 

(15/4) 

(15/4) 

(15/4) 





V 



(3/4) 
(3/4)/ 



(1338) 



It follows that the eigenenergies of the Hamiltonian in the absence of a magnetic field are: 



Ej^3 = u/2 degeneracy = 4 
E;i = —V degeneracy = 2 



(1339) 



On the other hand, in a strong magnetic field the spin-orbit term is neghgible. and we get: 

Eme,m, = - {mi + gms)h 
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== [49.3] Detailed analysis of the Zeeman effect 

The reduced Hamiltonian that describes (for example) the first £ ~ 1 level of the Hydrogen atom is the 6x6 matrix 



n = hL^+ghS^+vL- S 



(1341) 



where h = is the magnetic field in appropriate units, and v is determined by the spin orbit interaction. Also we 
have g = 2.0023. Using the identity 



s') ^ - r - 2 - 



(1342) 



the Hamiltonain can be written in the alternate form: 



H = hL^+ghS^ 



.I'- 



ll 



(1343) 



Using the results of previous sections we see that the Hamiltonian matrix in the \mf , m^) basis is: 



/ 1 \ 

1 





0-1 

Voooo -1 J 



gh 



1"^ 














\ 





1 

2 




















1 

2 




















1 

2 




















1 

2 





Vo 














-hi 



l\ 





V 




1 

V2 















1 










1 

2 








\) 



(1344) 



while in the |j, rrij) basis we have 
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The spectrum of Ti. can be found for a range of h values. See again the Mathematica file zeeman. nb. The results (in 
units such that v — 1) are illustrated in the following figure: 
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If the field is zero the Hamiltonian is diagonal in the \ j, rrij) basis and we find that 



V 

Ej^3 ~ — degeneracy = 4 



J 2 



-V degeneracy = 2 



(1345) 
(1346) 



In a very strong magnetic field we can make a rough approximation by neglecting the spin-orbit coupling. With this 
approximation the Hamiltonian is diagonal in the \m£,ms) basis and we get 



Emi,m, = (mi + gms)h 



(1347) 



In fact there are two levels that are exact eigensates of the Hamiltonian for any h. These are: 



■jnj—± T 



(1348) 



For a weak magnetic field it is better to write the Hamiltonian in the \j,mj) basis, so as to have L ■ S diagonal, 
while the Zceman terms are treated as a perturbation. We can determine (approximately) the splitting of the j 
multiplets by using degenerate perturbation theory. In order to do so we only need to find the j sub-matrices of 
the the Hamiltonian. We already know that they should obey the Wigner Eckart theorem. By inspection of the 



Hamiltonian we see that this is indeed the case. We have gi 
for J = 5- Hence we can write 



and gs = i for J = §, while gi 



and gs 



E — E 



{gMTnj)h 



(1349) 



where gm = gL + ggs is associated with the vector operator M = L + gS. In order to calculate g^ and gs we do not 
need to calculate the multiplication of the 6x6 matrices. We can simply can use the formulas 



gL 



gs 



jJ-L) ^ ji:j + l)+£i£+l)-s{s + l) 

2j(j + 1) 

3{j + l) + sis + l)-l{£+l) 



jJ-s) 

j{j + 1) 



2j(j + 1) 



(1350) 
(1351) 
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Special Topics 



[50] Quantization of the EM Field 



[50.1] The Classical Equations of Motion 



The equations of motion for a system which is composed of non-relativistic classical particles and EM fields are: 

mj-^^ = CiS - CiB X Xi (1352) 

V-f = 47rp 
dB 

- -Tt 
V-B = 
d£ 

V X B = —+4:71 J 

ot 

where 

p{x) = ei(5(a; — X,;) (1353) 

i 

J{x) = ei-ki5{x - Xi) 

i 

We also note that there is a continuity equation which is implied by the above definition and also can be regarded as 
a consistency requirement for the Maxwell equation: 

| = -V.J (1354) 

It is of course simpler to derive the EM field from a potential (V, A) as follows: 

B = V X A (1355) 

Then we can write an equivalent system of equations of motion 



dt 



rrij— ^ = ei£ - e.iB x x^ (1356) 



VM- — + 47rJ 



df^ dt 
V^V = -Anp 



[50.2] The Coulomb Gauge 



In order to further simplify the equations we would like to use a convenient gauge which is called the " Coulomb gauge" . 
To fix a gauge is essentially like choosing a reference for the energy. Once we fix the gauge in a given reference frame 
("laboratory") the formalism is no longer manifestly Lorentz invariant. Still the treatment is exact. 
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Any vector field can be written as a sum of two components, one that has zero divergence and another that has zero 
curl. For the case of the electric field £, the sum can be written as: 

£=£,,+£i_ (1357) 

where V • = and V x = 0. The field £± is called the transverse or solenoidal or "radiation" component, while 
the field £„ is the longitudinal or irrotational or "Coulomb" component. The same treatment can be done for the 
magnetic field B with the observation that from Maxwell's equations V • i3 = yields B\\ — 0. That means that there 
is no magnetic charge and hence the magnetic field B is entirely transverse. So now we have 

EM field = (£„ , f _L , S) = Coulomb field + radiation field (1358) 

Without loss of generality we can derive the radiation field from a transverse vector potential. Thus we have: 



£» = -Vy (1359) 
dA 

B^V xA 



£± = - 



This is called the Coulomb gauge, which we use from now on. We can solve the Maxwell equation for V in this gauge, 
leading to the potential 

Now we can insert the solution into the equations of motion of the particles. The new system of equations is 



ei£±_ — CiB X Xj (1361) 



VM + 47r J_L 



It looks as if J,, is missing in the last equation. In fact it can be easily shown that it cancells with the term due 
to the continuity equation that relates dtp to W ■ J, and hence dt'^V to J„ respectively. 

^^^^= [50.3] Hamiltonian for the Particles 

We already know from the course in classical mechanics that the Hamiltonian, from which the equations of motion of 
one particles in EM field are derived, is 

W''"' = 77^(p,-e,A(x,))'+e,l/(x,) (1362) 
Ami 

This Hamiltonian assumes the presence of A while V is potential which is created by all the other particles. Once we 
consider the many body system, the potential V is replaced by a mutual Coulomb interaction term, from which the 
forces on any of the particles are derived: 

n-T.^JP.-e.AMf + \j:^ (1363) 
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By itself the direct Coulomb interaction between the particles seems to contradict relativity. This is due to the non- 
invariant way that we had separated the electric field into components. In fact our treatment is exact: as a whole the 
Hamiltonian that we got is Lorentz invariant, as far as the EM field is concerned. 

The factor 1/2 in the Coulomb interaction is there in order to compensate for the double counting of the interactions. 
What about the diagonal terms i ~ jl We can keep them if we want because they add just a finite constant term 
to the Hamiltonian. The "self interaction" infinite term can be regularized by assuming that each particle has a very 
small radius. To drop this constant from the Hamiltonian means that "infinite distance" between the particles is taken 
as the reference state for the energy. However, it is more convenient to keep this infinite constant in the Hamiltonian, 
because then we can write: 

^ = II 7Tir(P' - e,A(x,))' + ^ fs^d^x (1364) 



2mi 8tt 



In order to get the latter expression we have used the following identity: 

\x-x'\ Sir J " ^ ' 

The derivation of this identity is based on Gauss law, integration by parts, and using the fact that the Laplacian of 
l/\x — x'\ is a delta function. Again we emphasize that the integral diverges unless we regularize the physical size of 
the particles, or else we have to subtract an infinite constant that represents the "self interaction". 

== [50.4] Hamiltonian for the Radiation Field 

So now we need another term in the Hamiltonian from which the second equation of motion is derived 



= 47rJj_ (1366) 



dt^ 

In order to decouple the above equations into "normal modes" we write A in a Fourier scries: 

A{x) = ^ ] V Ak (^"^ = ^ ] V Ak,c. e''"e'"" (1367) 
V volume V volume 

where e'^'" for a given fc is a set of two orthogonal unit vectors. If A were a general vector field rather than a transverse 
field, then we would have to include a third unit vector. Note that V • A = is like k ■ Ak = 0- Now we can rewrite 
the equation of motion as 

Ak^o. + i^lAk.a - 47r Jfc,„ (1368) 

where ujk = \k\. The disadvantage of this Fourier expansion is that it does not reflect that A{x) is a real field. In fact 
the Ak^a should satisfy A-k,a = (^fc,a)*- In order to have proper "normal mode" coordinates we have to replace each 
pair of complex Ak^a and A-k,a by a pair of real coordinates A'f, ^ and A'^ ^. Namely 

Afc,„ = -^[A',,„+zA','J (1369) 

We also use a similar decomposition for Jk,a- We choose the l/^/2 normalization so as to have the following identity: 

fj{x)-A{x)dx = Y.Jk.o.Ak.o. = Y.iJio.Alo. + J'UA'U ^ Y.JrQr (1370) 

k,a [k],a r 
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In the sum over degrees of freedom r we must remember to avoid double counting. The vectors k and —k represent 
the same direction which we denote as [k]. The variable ^ is the same variable as A'j, ^, and the variable A"^ ^ 
is the same variable as —A'^ We denote this set of coordinates Qr, and the conjugate momenta as P,.- We see that 
each of the normal coordinates Qr has a "mass" that equals l/(47r) [CGS!!!]. Therefore the conjugate "momenta" are 
Pr = [l/(47r)]Qr, which up to a factor arc just the Fourier components of the electric field. Now we can write the 
Hamiltonian as 



Ti-rad = ^ 



-Pr + ^maSS ■ Uj'^ QI ~ JrQr 



2 ■ mass ^ 2 



(1371) 



where r is the sum over all the degrees of freedom; two independent modes for each direction and polarization. By 
straightforward algebra the sum can be written as 

Tirad = ^ J^^^^ B^)d^x - J J- Ad^x (1372) 

More generally, if we want to write the total Hamiltonian for the particle and the EM field we have to ensure that 
~dTi/dA{x) = J{x). It is not difficult to figure out the the following Hamiltonian is doing the job: 

n = Hp„,..,.. + H.„i„„ + K„„ = y -^(p, - eM(x,))' + ^ /(f' + B'')d'x (1373) 

The term that corresponds to J • ^ is present in the first term of the Hamiltonian. As expected this terms has a dual 
role: on the one hand it gives the Lorcntz force on the particle, while on the other hand it provides the source term 
that drives the EM field. I should be emphasized that the way we write the Hamiltonian is somewhat misleading: 
The Coulomb potential term (which involves £f^) is combined with the "kinetic" term of the radiation field (which 
involves £\). 



= [50.5] Quantization of the EM Field 

Now that wc know the " normal coordinates" of the EM field the quantization is trivial. For each " oscillator" of " mass" 
l/(47r) we can define a and operators such that Qr — (27r/Li;)^/^(a,- + a\.). Since wc have two distinct variables for 
each direction, we use the notations (6,^^) and {c,c^) respectively: 



= A'L = -A'U^ = (c[,], + cf,, J (1375) 



In order to make the final expressions look more elegant we use the following canonical transformation: 

a+ = ^(^ + *c) (1376) 
a- = -^(fe-ic) 

It can be easily verified by calculating the commutators that the transformation from (&, c) to (a_|-,a_) is canonical. 
Also note that b^b + c = a\_a^ + oLa^. Since the oscillators (normal modes) are uncoupled the total Hamiltonian 
is a simple sum over all the modes: 

H = ^ (^fc^l.a^fe," + ^kcl^^Ck^a) = ^ ^^k.al^^ak^a (1377) 
[k],a k,a 
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For completeness we also write the expression for the field operators: 



UJk V 



'^\ak.o. + al^J (f378) 



and hence 



V volume , — V ^k 

k,a 

The eigenstates of the EM field are 

\ni,n2,n3,...,nk,a,...) (1380) 
We refer to the ground state as the vacuum state: 

Ivacuum) = |0, 0,0,0,...) (f38f) 
Next we define the one photon state as follows: 

|one photon state) = aj^^ jvacuum) (1382) 
and we can also define two photon states (disregarding normalization) : 

|two photon state) = a\^a2^kiai |vacuum) (1383) 
In particular we can have two photons in the same mode: 

|two photon state) (O'kiai)'^ |vacuum) (1384) 
and in general we can have photon states or any superposition of such states. 

An important application of the above formalism is for the calculation of spontaneous emission. Let us assume 
that the atom has an excited level Eb and a ground state Ea- The atom is prepared in the excited state, and the 
electromagnetic field is assume to be initially in a vacuum state. According to Fermi Golden Rule the system decays 
into final states with one photon LUk = {Eb — Ea)- Regarding the atom as a point-like object the interaction term is 

W„«„„ « --A{0)-v (1385) 
c 

where v = p/m is the velocity operator. It is useful to realize that vab = i{EB — Ea)xab- The vector D ~ xab is 
known as the dipole matrix element. It follows that matrix element for the decay is 

liukc - 1,A|?^.„_ Ivacuum, S)p = — ^ (-) ' 27rt^fe|£'=^" • Dp (1386) 

volume V c / 

In order to calculate the decay rate we have to multiply this expression by the density of the final states, to integrate 
over all the possible directions of k, and to sum over the two possible polarizations a. 
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[51] Quantization of a many body system 



[51.1] Second Quantization 



If wc regard the electromagnetic field as a collection of oscillators then we call and a raising and lowering operators. 
This is "first quantization" language. But we can also call and a creation and destruction operators. Then it is 
"second quantization" language. So for the electromagnetic field the distinction between "first quantization" and 
"second quantization" is merely a linguistic issue. Rather than talking about "exciting" an oscillator we talk about 
"occupying" a mode. 

For particles the distinction between "first quantization" and "second quantization" is not merely a linguistic issue. 
The quantization of one particle is called " first quantization" . If we treat several distinct particles (say a proton and 
an electron) using the same formalism then it is still " first quantization" . 

If we have many (identical) electrons then a problem arises. The Hamiltonian commutes with "transpositions" of 
particles, and therefore its eigenstates can be categorized by their symmetry under permutations. In particular there 
are two special subspaces: those of states that are symmetric for any transposition, and those that are antisymmetric 
for any transposition. It turns out that in nature there is a "super-selection" rule that allows only one of these two 
symmetries, depending on the type of particle. Accordingly we distinguish between Fermions and Bosons. All other 
sub-spaces are excluded as "non-physical". 

We would like to argue that the "first quantization" approach, is simply the wrong language to describe a system of 
identical particles. We shall show that if we use the "correct" language, then the distinction between Fermions and 
Bosons comes out in a natural way. Moreover, there is no need for the super-selection rule! 

The key observation is related to the definition of Hilbert space. If the particles are distinct it makes sense to ask 
"where is each particle". But if the particles are identical this question is meaningless. The correct question is "how 
many particles are in each site" . The space of all possible occupations is called " Fock space" . Using mathematical 
language we can say that in " first quantization" , Hilbert space is the external product of " one-particle spaces" . In 
contrast to that, Fock space is the external product of "one site spaces". 

When we say "site" we mean any "point" in space. Obviously we are going to demand at a later stage "invariance" of 
the formalism with respect to the choice of one-particle basis. The formalism should look the same if we talk about 
occupation of "position states" or if we talk about occupation of "momentum states". Depending on the context we 
talk about occupation of "sites'' or of "orbitals" or of "modes" or we can simply use the term "one particle states". 

Given a set of orbitals \r) the Fock space is spanned by the basis {\...,nr, ■■■,ns, ...)}. We can define a subspace of all 
particles states 

span^{|...,nr, ...,ns, ...)} (1387) 

that includes all the superpositions of basis states with Ur = N particles. On the other hand, if we use the first 
quantization approach, we can define Hilbert subspaces that contains only totally symmetric or totally anti-symmetric 
states: 



span^{|ri,r2, ...,rAr)} 

The mathematical claim is that there is a one-to-one correspondence between Fock span^Y states and Hilbert span^ 
or span^ states for Bosons and Fermions respectively. The identification is expressed as follows: 



where ri, ...jT^v label the occupied orbitals, P is an arbitrary permutation operator, ^ is -t-1 for Bosons and —1 for 
Fermions, and = N\/{nrlnsl...). We note that in the case of Fermions the formula above can be written as a Slater 



span5{|ri,r2, ...,rAr)} 



(1388) 




(1389) 



p 
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determinant. In order to adhere to the common notations we use the standard representation: 



{Xi, ...,XAf|*) = 



{xi\ri) ■■■ (xilryv) 
{xnVi) ■■■ {xN\rN) 



(1390) 



In particular for occupation of iV = 2 particles in orbitals r and s we get 



^{xi,X2) = --^(v3''(a;i)v?"(x2) - Lp' {xi)if'' {X2)) 



(1391) 



In the following section we discuss only the Fock space formalism. Nowadays the first quantization Hilbert space 
approach is used mainly for the analysis of two particle systems. For larger number of particles the Fock formalism 
is much more convenient, and all the issue of " symmetrization" is avoided. 



[51.2] Raising and Lowering Operators 



First we would like to discuss the mathematics of a single " site" . The basis states | n) can be regarded as the eigenstates 
of a number operator: 



fi\n) ~ n\n) 

/O 0\ 



(1392) 



In general a lowering operator has the property 

a \n) ^ f{n) \n - 1) 
and its matrix representation is: 
/O * \ 



The adjoint is a raising operator: 

a^|n) =/(n + l)|?i + l) 
and its matrix representation is: 
/O 0\ 



(1393) 



(1394) 



(1395) 



(1396) 



0/ 
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By appropriate gauge we can assume without loss of generality that f{n) is real and non- negative, so we can write 



fin) = y/^) (1397) 
From the definition of d it follows that 

d^'aln) = g{n)\n) (1398) 
and therefore 

ata = g(n) (1399) 
There are 3 cases of interest 



• The raising/lowering is unbounded (—oo < n < oo) 

• The raising/lowering is bounded from one side (say < n < oo) 



• The raising/lowering is bounded from both sides (say < n < M) 

The simplest choice for g{n) in the first case is 

g{n) = 1 (1400) 

In such a case a becomes the translation operator, and the spectrum of n stretches from —oo to oo. The simplest 
choice for 17(71) in the second case is 

gin) = n (1401) 

this leads to the same algebra as in the case of an harmonic oscillator. The simplest choice for gin) in the third case 
is 

gin) = iM-n)n (1402) 
Here it turns out that the algebra is the same as that for angular momentum. To sec that it is indeed like that define 

TO = n — — ^ ■ • • ' (1403) 
where s = (A/" — l)/2. Then it is possible to write 

^(to) = s(s + 1) - m(TO + 1) (1404) 

In the next sections we are going to discuss the "Bosonic" case Af = 00 with gin) = n, and the "Fermionic" case 
Af = 2 with gin) = nil — n). Later we are going to argue that these are the only two possibilities that are relevant 
to the description of many body occupation. 
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It is worthwhile to note that the algebra of "angular momentum" can be formally obtained from the Bosonic algebra 
using a trick due to Schwinger. Let us define two Bosonic operators ai and 02, and 

= 4ai (1405) 

The operator moves a particle from site 1 to site 2. Consider how c and operate within the subspace of {N — 1) 
particle states. It is clear that c and act like lowering/raising operators with respect to m = (0^02 — a\ai)/2. 
Obviously the lowering/raising operation in bounded from both ends. In fact it is easy to verify that c and have 
the same algebra as that of "angular momentum" . 



== [51.3] Algebraic characterization of field operators 

In this section we establish some mathematical observations that we need for a later reasoning regarding the classifi- 
cation of field operators as describing Bosons or Fermions. By field operators we mean either creation or destruction 
operators, to which we refer below as raising or lowering (ladder) operators. We can characterize a lowering operator 
as follows: 

n(ji \ti) ^ = (71 — l)^a |7i) ^ for any n (1406) 
which is equivalent to 

7ia = a(7i-l) (1407) 

A raising operator is similarly characterized by ha^ = a^(fi + 1). 
It is possible to make a more interesting statement. Given that 

[a,a^] = aa'-a^'a = 1 (1408) 

we deduce that a and are lowering and raising operators with respect to n = a^a. The prove of this statement 
follows directly form the observation of the previous paragraph. Furthermore, from 

||a|n)|| = {n\a''a\n) = n (1409) 
||oV>ll = (n|aaV> = l + « (1410) 



Since the norm is a non-negative value it follows that n = and hence also all the positive integer values n = 1,2, 
form its spectrum. Thus in such case a and describe a system of Bosons. 
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Let us now figure our the nature of an operator that satisfies the analogous anti-commutation relation: 

[a,a^]+ = aa^+a^a = 1 (1411) 

Again we define n ~ a^a and observe that a and are characterized by ha ~ a{l ~ n) and na^ ~ a^(l — h). Hence 
we deduce that both a and simply make transposition of two n states |e) and |1— e). Furthermore 

||a|n)|| = (n\a'a\n) = n (1412) 
||aV>ll = {n\aa^\n) = 1-n (1413) 

Since the norm is a non-negative value it follows that < e < 1. Thus we deduce that the irreducible representation 
of a is 



^/^' 




(1414) 



One can easily verify the the desired anti-commutation is indeed satisfied. We can always re-define n such that n=0 
would correspond to one eigenvalue and n=l would correspond to the second one. Hence it is clear that a and 
describe a system of Fcrmions. 

== [51.4] Creation Operators for "Bosons" 



For a "Bosonic" site we define 

a\n) = ^/Tl\n - \) (1415) 

hence 

|n) = V^T+T |n + 1) (1416) 

and 

{a,a^\ = aa^ -a^a = \ (1417) 
If we have many sites then we define 

al^\®\®---®a' ®---®\ (1418) 
which means 

a]! |ni, 712, . . . , rir, . . . ) = V^r + 1 1^1, 7l2, . . . , 71,. + 1, . . . ) (1419) 

and hence 

[a,.,as]= (1420) 

and 

[a,,al] =(5,,, (1421) 
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Wc have defined our set of creation operators using a particular one-particle basis. What will happen if we switch to 
a different basis? Say from the position basis to the momentum basis? In the new basis we would like to have the 
same type of "occupation rules", namely, 



(1422) 



Let's see that indeed this is the case. The unitary transformation matrix from the original \r) basis to the new la) 
basis is 

Tr.^ = {r\a) (1423) 
Then we have the relation 

H=^|r)(r|a)=^r,,Jr) (1424) 

r r 

and therefore 

^ J2^r,aal (1425) 

r 

Taking the adjoint wc also have 

a„ = ^T;*,a, (1426) 

r 

Now WC find that 

4] = E [^raar, Ts^al] = r;„T,0<5,,, = (Tt)^^, Trp = {T^T)^^ = (1427) 

r.s 

This result shows that and ajj are indeed destruction and creation operators of the same "type" as and a\.. Can 
we have the same type of invariance for other types of occupation? We shall sec that the only other possibility that 
allows "invariant" description \s M = 2. 

== [51.5] Creation Operators for "Fermions" 

In analogy with the case of a " Boson site" we define a " Fermion site" using 

a|n)=V^|n-l) (1428) 

and 

\n) = V'l + 1 1^ + 1) with mod(2) plus operation (1429) 
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The representation of the operators is, using PauH matrices: 

0\ 1 , . . 

1 0) = 9j'''-'^^^ 

= ( n n ) = ^ ('^1 +*'^2) 



o; 2 

aa} = i — n 

[a, a^] ^ = aa^ + a^a = 1 

while 

[a, a'] = 1 - 2n (1431) 

Now we would like to proceed with the many-site system as in the case of " Bosonic sites" . But the problem is that 
the algebra 

[ttr, al] = (5,r,s(l - 2alar) (1432) 

is manifestly not invariant under a change of one-particle basis. The only hope is to have 

[ar,al]^^Sr,s (1433) 

which means that and for r ^ s should anti-commute rather than commute. Can we define the operators in 
such a way? It turns out that there is such a possibility: 

a];|ni,n2,...,n^,...) = (-1)^=0'-' "Vl + n2, . . . , + 1, . . . ) (1434) 

For example, it is easily verified that we have: 

4al|0, 0,0,...) = -4410,0,0,...) = |1,1,0,...) (1435) 

With the above convention if we create particles in the "natural order" then the sign comes out plus, while for any 
"violation" of the natural order we get a minus factor. 



== [51.6] One Body Additive Operators 

Let us assume that we have an additive quantity Y which is not the same for different one-particle states. One 
example is the (total) kinetic energy, another example is the (total) potential energy. It is natural to define the many 
body operator that corresponds to such a property in the basis where the one-body operator is diagonal. In the case 
of potential energy it is the position basis: 

y = ^ VoL^ana = ^ 4^a,aaa (1436) 



This counts the amount of particles in each a and multiplies the result with the value of V at this site. If we go to a 
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different onc-particle basis then we should use the transformation 

a^ = Y.Ko.^k (1437) 

A: 
k' 

leading to 

V = Y,al,Vk' ,kak (1438) 
fc.fe' 



Given the above result we can calculate the matrix elements from a transition between two different occupations: 

~ l,n2 + l|y|ni, 712)1' = ("2 + 1) ni |1^2,i|' (1439) 

What we get is quite amazing: in the case of Bosons we get an amplification of the transition if the second level is 
already occupied. In the case of Fermions we get "blocking" if the second level is already occupied. Obviously this 
goes beyond classical reasoning. The latter would give merely ni as a prefactor. 



[51.7] Two Body "Additive" Operators 



It is straightforward to make a generalization to the case of two body "additive" operators. Such operators may 
represent the two-body interaction between the particles. For example we can take the Coulomb interaction, which 
is diagonal in the position basis. Thus we have 

1 . 1 ■ 

[/ = - ^ Uap.aliflafllB + 2 X! {fla ~ 1) (1440) 

a^P a 

Using the relation 

- _ 1) for a = 2 ^1441) 

We get the simple expression 

U = Uap,al3 apCLa (1442) 

a, 13 

and for a general one-particle basis 

U^\Y1 Uk'VM aidk (1443) 

k'l',kl 

We call such operator "additive" (with quotations) because in fact they are not really additive. An example for 
a genuine two body additive operator is where A and B are one body operators. This observation is very 

important in the theory of linear response (Kubo). 



237 



== [51.8] Matrix elements with particle states 

Consider an N particle state of a Ferniionic system, which is characterized by a definite occupation of k orbitals: 



\Rn) = al 



N 



I|o> 



(1444) 



For the expectation value of a one body operator we get 



{Rn\V\Rn) - J2 



(1445) 



because only the terms with fc = fc' do not vanish. If we have two N particle states with definite occupations, then the 
matrix element of V would be in general zero unless they differ by a single electronic transition, say from an orbital 
kg to another orbital fcg. In the latter case we get the result Vk'^^kg if the other electrons are not involved. 

For the two body operator we get for the expectation value a more interesting result that goes beyond the naive 
expectation. The only non-vanishing terms in the sandwich calculation are those with cither k' ~ k and I' = I ot with 
k' = I and /' = k. All the other possibilities give zero. Consequently 



{Rn\U\Rn) ^\Y1 MU\kl) ~ {lk\U\kl) 

exchange 



(1446) 



A common application of this formula is in the context of multi-electron atoms and molecules, where U is the 
Coulomb interaction. The direct term has an obvious electrostatic interpretation, while the exchange term reflects the 
implications of the Fermi statistics. In such application the exchange term is non- vanishing whenever two orbitals have 
a non-zero spatial overlap. Electrons that occupy well separated orbitals have only a direct electrostatic interaction. 



[51.9] Introduction to the Kondo problem 



One can wonder whether the Fermi energy, due to the Pauli exclusion principle, is like a lower cutoff that "regularize" 
the scattering cross section of of electrons in a metal. We explain below that this is not the case unless the scattering 
involves a spin flip. The latter is known as the Kondo effect. The scattering is described by 



V 



k'k 



(1447) 



hence: 



jn[2] 



V- 



E-H + iO 



:V 



ki) = 



{k2 al,Vkik,ak,^—^^-^al,Vk'^M^ak^ (1448) 



where both the initial and the flnal states are zero temperature Fermi sea with one additional electron above the 
Fermi energy. The initial and flnal states have the same energy: 



(1449) 



where Eq is the total energy of the zero temperature Fermi sea. The key observation is that all the intermediate 
states are with definite occupation. Therefore we can pull out the resolvent: 



y[2] ^ 



.Kk 



Vk(,kH^k'.ka 



E~ E, 



ka,k' 



fci 



(1450) 



where 
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ka,k' 



(1451) 



Energy levels diagram 
k = represeneted by vertical shift 

I 1 



Feynman diagram 
k = represeneted by direction 




time 



As in the calculation of "exchange" we have two non-zero contribution to the sum. These are illustrated in the figure 
above: Either (fc^, fcf,, /c^, ka) equals (fc2, k' , fc', fci) with fc' above the Fermi energy, or (fc', fci, fc2, k') with fc' below the 
Fermi energy. Accordingly E — Ek^^k'^ equals either (e^^ — et') or — (e^^ — e^/). Hence we get 



y[2] = 



Vk2,k'Vk'.ki 



al^ak'ttlak. 



ki 



Vk'.kiVk2,k' 

^(cfci - ek') + iO 



alittkial^Uk' 



ki 



(1452) 



Next we use 
^k. 



al^ak'ttluk. 



fci) = ^^2 43(1 ~ nk')ak^ fci) = +1 X (^k 



al^ak, 



fci 



(1453) 



which holds if k' is above the Fermi energy (otherwise it is zero). And 



al,ak,al^ak' 



fci) = (k 



fci ) = -1 X (fc 



al^Qk, 



fci 



(1454) 



which holds if fc' is below the Fermi energy (otherwise it is zero) . Note that without loss of generality wc can assume 
gauge such that {k2\al,^aki |fci) = 1 Coming back to the transition matrix we get a result which is not divergent at the 
Fermi energy: 



E 



Vk2,k'Vk\ki 



E 

k'^hclo 



Vk2,k'Vk',ki 



(1455) 



If we arc above the Fermi energy, then it is as if the Fermi energy does not exist at all. But if the scattering involves 
a spin flip, as in the Kondo problem, the divergence for e close to the Fermi energy is not avoided. Say that wc want 
to calculate the scattering amplitude 



(fc2T,4|r|fci,T,^> 



(1456) 



where the double arrow stands for the spin of a magnetic impurity. It is clear that the only sequences that contribute 
are those that take place above the Fermi energy. The other set of sequences, that involve the creation of an electron- 
hole pair do not exist: Since we assume that the magnetic impurity is initially "down", it is not possible to generate 
a pair such that the electron spin is " up" . 
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== [51.10] Green functions for many body systems 

The Green function in the one particle formahsm is defined via the resolvent as the Fourier transform of the propagator. 
In the many body formalism the role of the propagator is taken by the time ordered correlation of field operators. 
In both cases the properly defined Green function can be used in order to analyze scattering problems in essentially 
the same manner. It is simplest to illustrate this observation using the example of the previous section. The Green 
function in the many body context is defined as 



G'fc2.fei(e) 



-iFT 



{^\TakAt2)alitip) 



(1457) 



If "I" is the vacuum state this coincides with the one particle definition of the Green function: 



(1458) 



But if 'J is (say) a non-empty zero temperature Fermi sea then also for t2 < ti we get a non-zero contribution due to 
the possibility to annihilate an electron in an occupied orbital. Thus we get 



Cfc>CF 



f-k + iO 



E 

efc<«F 



5ki,k5k2,k 

e- ek -iO 



(1459) 



One should observe that the many-body definition is designed so as to reproduce the correct T matrix as found in 
the previous section. The definition above allows us to adopt an optional point of view of the scattering process: a 
one particle point of view instead of a many body point of view! In the many body point of view an electron-hole pair 
can be created, and later the hole is likely to be annihilated with the injected electron. In the one particle point of 
view the injected electron can be "reflected" to move backwards in time and then is likely to be scattered back to the 
forward time direction. The idea here is to regard antiparticlcs as particles that travel backwards in time. This idea 
is best familiar in the context of the Dirac equation. 
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[52] Wigner function and Wigner-Weyl formalism 
== [52.1] The classical description of a state 



Classical states are described by a probability function. Given a random variable x, we define p(x) — Prob(a; = x) as 
the probability of the event x = where the possible values of x are called the spectrum of the random variable x. For 
a random variable continuous spectrum we define the probability density function via p{x)dx = Prob(a; < x < x + dx). 
The expectation value of a random variable A = A{x) is defined as 

(i) =^p(.t) A(x) (1460) 

and for a continuous variable as. For simplicity we use, from now on, a notation as if the random variables have a 
discrete spectrum, with the understanding that in the case of a continuous spectrum we should replace the ^ by an 
integral with the appropriate measure (e.g. dx or dp / {2-kK)) . Unless essential for the presentation we set h = 1. 

Let us consider two random variables x,p. One can ask what is the joint probability distribution for these two 
variables. This is a valid question only in classical mechanics. In quantum mechanics one usually cannot ask this 
question since not all variables can be measured in the same measurement. In order to find the joint probability 
function of these two variables in quantum mechanics one needs to build a more sophisticated method. The solution 
is to regard the expectation value as the fundamental outcome and to define the probability function as an expectation 
value. In the case of one variable such as x or p, we define probability functions as 

p(X) = ( S{x - X) ) (1461) 
p{P) = {2^5{p-P) ) 

Now we can also define the joint probability function as 

p{X,P) = {2T:5{jj-P) 5{x-X)) (1462) 

This probability function is normalized so that 

j p{X,P)^^ = (^l 5{p-P)dP J 5{x-X)dX^^l (1463) 

In the next section we shall define essentially the same object in the framework of quantum mechanics. 

== [52.2] Wigner function 

Wigner function is a real normalized function which is defined as 

p„{X,P) = (\2^5{p-P)5{x-X)] ) (1464) 

In what follows we define what we mean by symmetrization ("sym"), and we relate pyj(X,P) to the conventional 
probability matrix p{x\x"). We recall that the latter is defined as 



pix',x")^(^P^^ ) = (^\x"){x'\ ) (1465) 
The "Wigner function formula" that we are going to prove is 



Pw(^,P)= J p(^X + K,X-^i 



r ] c-'P'^dr (1466) 
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Thus to go from the probabihty matrix to the Wigner function is merely a Fourier transform, and can be loosely 
regarded as a change from "position representation" to "phase space representation" . 

Moreover we can use the same transformation to switch the representation of an observable A from A{x',x") to 
A{X, P) . Then we shall prove the "Wigner- Weyl formula" 

traceiAp)^ J AiX,P)p„{X,P)^^ (1467) 

This formula implies that expectation values of an observable can be calculated using a semi-classical calculation. 
This extension of the Wigner function formalism is known as the Wigner- Weyl formalism. 

^^^^= [52.3] Mathematical derivations 



Fourier transform reminder: 



F{k) = J f{x)c-"'''dx (1468) 
Inner product is invariant under change of representation 

/rlk 
—F{k)G*{k) (1469) 

For the matrix representation of an operator A we use the notation 

A{x\x") = {x'\A\x") (1470) 
It is convenient to replace the row and column indexes by diagonal and off-diagonal coordinates: 

= the diagonal coordinate (1471) 
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and to use the alternate notation 

A 



X-\r) (1472) 



AiX,r)^(x + \r.^.. 2 
Using this notation, the transformation to phase space representations can be written as 



AiX,P) = J AiX,r)e-'^''dr (1473) 



Note that if A is hermitian then A{X, —r) — A*{X, +r), and consequently A{X, P) is a real function. Moreover, the 
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trace of two hermitian operators can be written as a phase space integral 



trace(li3) = j A{x' ,x")B{x\x')dx' dx" ^ j A{x' ,x")B* {x' ,x")dx' dx" 
A{X,+r)B*{X,r)dXdr = / A{X, P)B{X, P)'- 



(1474) 



dXdP 



This we call the " Wigner- Weyl formula" . 

For the derivation of the "Wigner function formula" we have to show that 5{p — P)S{x — X) form a complete set of 
observables that are linearly related to the standard set \x'){x"\. First we cite an helpful identity: 



\X){X\ ^ S{x-X) = I 



-X) 



(1475) 



In order to prove the first part of this identity it is easier to use discrete notation, |no)(no| = 6n,no- The left hand 
side is a projector whose only non-zero matrix element is n = m = no. The right hand side is a function of fiji) 
such that f{n) ~ 1 for n = tiq and zero otherwise. Therefore the right hand side is also diagonal in n with the same 
representation. Now we can write 



x'){x"\ = X-{rl2)^(x+{r/2) = e^('^/2)p|^^ ^^|pz(r/2)j5 ^ S^/^)p5{x ~ Xy'^''/'^'^^ 



!^Qi{r/2)p^ip{x-X)^t{r/2)p ^ f ^-P 



2tt 



^ irp+ip{x-X) 



(1476) 
(1477) 



where in the last step we have used the identity e^^^ ~ e"^e^e2 ["^^^1 , and the commutation = i, in order to join 
the exponents as in the equality qvI^^x^pIi ^ gx+p^ ^j^j-^ ^j^^ 

same spirit we define the symmetrized operator 



27r(5(p - P) 8(x - X) 



drdp 
2n 



^ir{p—P)-\-ip{x — X) 



(1478) 



The derivation of the Wigner function formula follows in a straightforward fashion from the implied linear relation 



2tt6{p - P) 6{x - X) 



dre 



-irP 



X-{r/2))l^X+{r/2) 



(1479) 



=== [52.4] Applications of the Wigner Weyl formalism 

In analogy with the definition of the Wigner function p„{X^P) which is associated with p we can define a Wigner- 
Weyl representation A^fj^{X, P) of any hermitian operator A. The phase space function Aww(^j P) is obtained from 
the standard representation A{x' , x") using the same "Wigner function formula" recipe. Let us consider the simplest 
examples. First consider how the recipe works for the operator x: 



{x\x\x") ~ x'6{x'—x") = X5{r) X 



(1480) 
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Similarly p ^^-^ P. Further examples are: 

fix) ^ f{X) (1481) 
g{p) ™ 9iP) (1482) 
xp ™ XP+^i (1483) 

px ™ ^P-l^ (1484) 
i(ip + pi) ™ (1485) 

In general for appropriate ordering we get that f{x,p) is represented by f{X, P). But in practical applications f{x,p) 
will not have the desired ordering, and therefore this recipe should be considered as a leading term in a semiclassical 
h expansion. 

There are two major applications of the Wigner Weyl formula. The first one is the calculation of the partition function. 

Z{(3) = Yl = E (He-^^|r) = ^306(0"^^^) = / ^^(c-^h(x,p)^ + o{h) (1486) 

r r 

The second one is the calculation of the number of eigenstates up to a given energy E 

N{E) = l = ^e(£:-£;^) = trace[e(£:-H)] (1487) 

Er<E r 



iH(x.P)<E 27r 

Below we discuss some further applications that shed light on the dynamics of wavepackets, interference, and on the 
nature of quantum mechanical states. We note that the time evolution of Wigner function is similar but not identical 
to the evolution of a classical distribution unless the Hamiltonian is a quadratic function of x and p. 

== [52.5] Wigner function for a Gaussian wavepacket 



A Gaussian wavepacket in the position representation is written as 

*(a-) = ^ p-^^^if^p'Pn^ (1488) 
V -\/27r(T 

The probability density matrix is 

p{X,r) = ( X+ir ) ( X-ir ) (1489) 



2 y V 2 , 

n TT^ 1^2 +^Por ^ „ — — T-y+'PiH- 



V27rcr VSttct 
Transforming to the Wigner representation 

Pw(^,-P) = / 5^^^ dr = e ^ (1490) 
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where Gr^ = a and Cp = It follows that cr^Cp = 1/2. Let us no go backwards. Assume that have a Gaussian in 
phase space, which is characterized by some and Cp. Does it represent a legitimate quantum mechanical state? 
The normalization condition trace(yo) = 1 is automatically satisfied. We also easily find that 

trace(p2) ^ / e < "5 ^— = (1491) 

Wc know that trace(/9^) = 1 implies pure state. If trace{p^) < 1 it follows that wc have a mixed state, whereas 
trace(p^) > 1 is not physical. It is important to remember that not any p(X,P) corresponds to a legitimate quantum 
mechanical state. There are classical states that do not have quantum mechanical analog (e.g. point like preparation). 
Also the reverse is true: not any quantum state has a classical analogue. The latter is implied by the possibility to 
have negative regions in phase space. These is discussed in the next example. 



^^^^= [52.6] The Winger function of a bounded particle 

Wigncr function may have some modulation on a fine scale due to an interference effect. The simplest and most 
illuminating example is the Wigner function of the nth eigenstate of a particle in a one dimensional box (0 < x < L). 
The eigen-wavefunction that correspond to wavenumber k ~ {t^/L) x integer can be written as the sum of a right 
moving and a left moving wave ipi^) = (l/v^)('0i(x) -\- ijj2{x)) within Q < x < L, and ipi^) ~ otherwise. The 
corresponding Wigncr function is zero outside of the box. Inside the box it can be written as 

Pw(^,P) = ipi(X,P) + ip2(^,P)+Pi2(^,P) (1492) 

where pi2 is the interference component. The semiclassical components are concentrated at P = ±fc, while the 
interference component is concentrated at P = 0. The calculation of pi{X,P) in the interval Q < x < L/2 is 
determined solely by the presence of the hard wall at x = 0. The relevant component of the wavefunction is 

■<Pi{x) = ^e(a;)e''=^ (1493) 
yL 

and hence 

/OO 1 /"OO 

V'i(X + {rl2))ri{X ~ (r/2))e~'P'-dr = - Q{X + (r/2))e(X - (r/2))e-^(^-'^''-dr 
-OO ^ J — OO 

^.^P_k]r^^ = smc{2X{P-k)) (1494) 



L J-2X L 

This shows that as we approach the sharp feature the non-classical nature of Wigner function is enhanced, and the 
classical (delta) approximation becomes worse. The other components of Wigner function are similarly calculated, 
and for the interference component wc get 

4a: 

pi2 (X, P) = -2 cos(2fcA:) X — sinc(2A:P) (1495) 

It is easily verified that integration of p„{X,P) over P gives p{x) = 1 + 1 — 2cos(2fcAr) = 2(sin(fcAr))^. 

In many other cases the energy surface in phase space is "soft" (no hard walls) and then one can derive a uniform 
semiclassical approximation [Berry, Balazs]: 



^-(-"'^^ = a::(x:p)^-( Liip) 



(1496) 
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where for H. = p'^ / {2m) + V{x) 



1 



( -\vv{x)\^ + \iP- v)V(x) 

' m 



ll/3 

(1497) 



What can we get out of this expression? We see that p„{X,P) decays exponentially as we go outside of the energy 
surface. Inside the energy surface we have oscillations due to interference. 

The interference regions of the Wigncr function might be very significant. A nice example is given by Zurek. Let 
us assume that we have a superposition of iV 3> 1 non-overlapping Gaussian, we can write the Wigner function as 
p = {i/N)J2Pj + Pintrtr- Wc havc trace(p) ~ 1 and also trace(p^) = 1. This implies that trace(pi„trtr) = 0, while 
trace(pf,jj^,.J ^ 1. The latter conclusion stems from the observation that the classical contribution is x <c 1. 

Thus the interference regions of the Wigner function dominate the calculation. 



== [52.7] The Winger picture of a two slit experiment 

The textbook example of a two slit experiment will be analyzed below. The standard geometry is described in the 
upper panel of the following figure. The propagation of the wavepacket is in the y direction. The wavepacket is 
scattered by the slits in the x direction. The distance between the slits is d. The interference pattern is resolved on 
the screen. In the lower panel the phase-space picture of the dynamics is displayed. Wigncr function of the emerging 
wavepacket is projected onto the (x,px) plane. 




Px 



Ax 



X 



The wavepacket that emerges from the two slits is assumed to be a superposition 



(1498) 



The approximation is related to the normalization which assumes that the slits are well separated. Hence we can 
regard fi{x) ~ faix + {d/2)) and 'P2{x) ~ faix — {d/2)) as Gaussian wavepackets with a vanishingly small overlap. 
The probability matrix of the superposition is 



P{X',X")=^{X'W{X") = (^l(x')+^2(x-'))(^*l(x") + ^2(^")) = ^Pl + ^P2+A„we. 



(1499) 
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All the integrals that are encountered in the calculation are of the Wigncr function are of the type 

^0 [{X - Xo) + i(r - ro)^ [{X - Xo) - ^{r - r„)^ e'^P^ dr = po{X ~ Xo,P) e"^^'"" (1500) 

where Xq = zLd/2 and Tq = for the classical part of the Wigner function, while = and tq = ±d/2 for the 
interference part. Hence we get the result 

Pw(X,P) = ^po(^X + ^,P^ + ^po(^X -^,P^ + cos{Pd) po{X,P) (1501) 

Note that the momentum distribution can be obtained by integrating over X 

Pd 

p{P) = (l + cos(Pd))po(P) = 2cos2( — )po(P) (1502) 

In order to analyze the dynamics it is suggestive to write p{X^ P) schematically as a sum of partial-wavepackcts, each 
characterized by a different transverse momentum: 

oo 

p^{X,P) = PniX,P) (1503) 

n— — oo 

By definition the partial-wavepacket p„ equals p for \P — n x {2T:h/d)\ < nh/d and equals zero otherwise. Each 
partial wavepacket represents the possibility that the particle, being scattered by the slits, had acquired a transverse 
momentum which is an integer multiple of 

Ap = ^ (1504) 
d 

The corresponding angular separation is 

^ - ^ (1505) 

as expected. The associated spatial separation if we have a distance y from the slits to the screen is Aa; = Ady. It is 
important to distinguish between the "preparation" zone y < d, and the far- field (Franhaufer) zone y ^ d^/Xs- In 
the latter zone we have Ax ^ d or equivalently AxAp ^ 2TTh. 



[52.8] Thermal states 



A stationary state dp/dt has to satisfy [7i, p] = 0. This means that p is diagonal in the energy representation. It can 
be further argued that in typical circumstances the thermalized mixture is of the canonical type. Namely 

p = \^)Pr'^A = I E k>c-^'^'-(r-| = ^e-^^ (1506) 

Let us consider some typical examples. The first example is spin 1/2 in a magnetic field. In this case the energies are 
i?l = e/2 and — e/2. Therefore p takes the following form: 

.= ^^(f (1507) 



2cosh(i^e) V e-''* 
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Optionally one can represent the state of the spin by the polarization vector 

M = (o,0,tanh(i/3e)) (1508) 

The next example is a free particle. The Hamiltonian is H = fp' /2m. The partition function is 



dk f dXdP r,P^ { m \ '- , , 

-c-P^. = I — c-l^^ = L — — (1509) 



(27r)/L J 2-K 

The probability matrix p is diagonal in the p representation, and its Wigner function representation is identical 
with the classical expression. The standard x representation can be calculated either directly or via inverse Fourier 
transform of the Wigner function: 

p{x',x") = (^i^ c-^["'-""l' (1510) 

Optionally it can be regarded as a special case of the harmonic oscillator case. In the case of an harmonic oscillator 
the calculation is less trivial because the Hamiltonian is not diagonal neither in the x nor in the p representation. 
The eigcnstates of the Hamiltonian are H\n) = En\n) with En = + n)uj. The probability matrix is 

PnW = 45„«'C~''"(^+") (1511) 

where the partition function is 

Z = ^c--^^" = (2s\nh(h3uX\ (1512) 



In the X representation 

p{x\x") = ^(.T'|n)p„(n|.T") = ^p„^"(x')¥'"(a:") (1513) 

n n 

The last sum can be evaluated by using properties of Hermite polynomials, but this is very complicated. A much 
simpler strategy is to use of the Feynman path integral method. The calculation is done as for the propagator 
{x'\ exp{—iHt)\x") with the time t replaced by — z/3. The result is 

pix',x") cx c~ ^""m/^") ["°"''(^"H(^"'+^")-2^'^"] ^^5^4^ 

which leads to the Wigner function 



Pw(X,P) e \ ^ (1515) 



It is easily verified that in the zero temperature limit we get a minimal wavepacket that represent the pure ground 
state of the oscillator, while in high temperatures we get the classical result which represents a mixed thermal state. 



[53] Theory of quantum measurements 
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[53.1] The reduced probability matrix 



In this section we consider the possibihty of having a system that has interacted with its surrounding. So we have 
"system (g) environment" or "system <Si measurement device" or simply a system which is a part of a larger thing 
which we can call "universe" . The question that we would like to ask is as follows: Assuming that we know what is 
the sate of the "universe" , what is the way to calculate the state of the "system" ? 

The mathematical formulation of the problem is as follows. The pure states of the "system" span N^^^ dimensional 
Hilbert space, while the states of the "environment" span N^^^ dimensional Hilbert space. So the state of the "universe" 
is described hy N x N probability matrix Piaj/s, where N = N^y^N^,„. This means that if we have operator A which 
is represented by the matrix Aiajfj, then it expectation value is 

(A) = trace{Ap) = ^ Aa,ji3 Pji3,ia (1516) 

The probability matrix of the " system" is defined in the usual way. Namely, the matrix clement pjl is defined as the 
expectation value of P^^ = \i){j\®l. Hence 

= (^'^'> = trace(P^V) = ^ PL,ipPWMc.^ ^ 5kJLJ5c.,f}Pl|3Mo,^^Pja,^c, (1517) 

The common terminology is to say that p''^' is the reduced probability matrix, which is obtained by tracing out the 
environmental degrees of freedom. Just to show mathematical consistency we note that for a general system operator 
of the type A — A'^" ® 1°'™ we get as expected 

{A) = trace(A/7) = ^ A.^^.pp.p,,^ ^ H^'^}" = trace(A->=-) (1518) 



== [53.2] Entangled superposition 

Of particular interest is the case where the universe is in a pure state Choosing for the system (g) environemnt an 
arbitrary basis \ia) = \i) (g) \a), we can expand the wavefunction as 

l^-) = ^*.aKa) (1519) 

i,a. 

By summing over a we can write 



where Ix^'-*) ^'iaja) is called the "relative state" of the environment with respect to the ith state of the system, 

while Pi is the associated normalization factor. Note that the definition of the relative state implies that — \/Vj^a ■ 
Using these notations it follows that the reduced probability matrix of the system is 

P3I = E*J"*» = ix^'^lx^'^) (1521) 
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The prototype example for a system-environment entangled state is described by the superposition 

I*) = |2)®|X<'^) (1522) 

where |1) and |2) are orthonormal states of the system. The singlet state of two spin 1/2 particles is possibly the 
simplest example for an entangled superposition of this type. Later on we shall see that such entangled superposition 
may come out as a result of an interaction between the system and the environment. Namely, depending on the state 
of the system the environment, or the measurement apparatus, ends up in a different state x- Accordingly we do not 
assume that x^^^ and x^^'' a-re orthogonal, though we normalize each of them and pull out the normalization factors 
as pi and p2 ■ The reduced probability matrix of the system is 



f Pi ^*y/PlP2\ 

\\^/pip2 P2 J 



(1523) 



where A = (x'^'lx^^^). At the same time the environment is in a mixture of non-orthogonal states: 

= Pi \X^'^){X^'^\+P2 Ix^'^Xx^'^l (1524) 

The purity of the state of the system in the above example is determined by |A|, and can be characterized by 
trace(p^) = 1 — 2pip2{l—\X\^). The value trace(p^) = 1 indicates a pure state, while trace(p^) = 1/A^ with N = 2 
characterizes a 50%-50% mixture. Optionally the purity can be characterized by the Von Neumann entropy as 
discussed in a later section: This gives S[p] = for a pure state and S[p] = log(iV) with N = 2 for a 50%-50% 
mixture. 

^^^= [53.3] Schmidt decomposition 



If the "universe" is in a pure state we cannot write its p as a mixture of product states, but we can write its as an 
entangled superposition of product states. 

I*) = ^\^I*>«)|S«) (1525) 

i 

where the \Bi) is the "relative state" of subsystem B with respect to the zth state of subsystem A, while pi is the 
associated normalization factor. The states \Bi) are in general not orthogonal. The natural question that arise is 
whether we can find a decomposition such that the \Bi) are orthonormal. The answer is positive: Such decomposition 
exists and it is unique. It is called Schmidt decomposition, and it is based on singular value decomposition (SVD). 
Let us regard — Wi^a as an Na x Nb matrix. From linear algebra it is known that any matrix can be written in 
a unique way as a product: 

W(N^y(NB) = ^(Na-xNa)-^{'^a><Nb)^(Nb-x.Nb) (1526) 

where and are the so called left and right unitary matrices, while Z? is a diagonal matrix with so called 
(positive) singular values. Thus we can re-write the above matrix multiplication as 

^Y.^t^/P'r U^,^ (1527) 

r 

Substitution of this expression leads to the result 

l*> = Yl VP'Mr) ® \Br) (1528) 
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where \Ar) and \Br) are implied by the unitary transformations. We note that the normahzation of ^ impUcs 
J2Pr = 1- Furthermore the probabihty matrix is pfj'fp = Wi,aW* and therefore the calculation of the reduced 
probability matrix can be written as: 

pA ^ ^ [U^)D''{U^)^ (1529) 

This means that the matrices and have the same non-zero eigenvalues {pr}, or in other words it means that 
the degree of purity of the two subsystems is the same. 



^^^^= [53.4] The violation of Bells's inequality 

We can characterize the entangled state using a correlation function as in the EPR thought experiment. The correlation 
function C{0) = (C) is the expectation value of a so-called "witness operator" . If we perform the EPR experiment 
with two spin 1/2 particles (see the Fundamentals II section), then the witness operator is C — (S) <Jg, and the 
correlation function comes out C{6) ~ —cos{9), which violates Bell inequality. 

Let us see how the result for C{9) is derived. One possibility is to perform a straightforward calculation using explicit 
representation for the assumed singlet state, and for the witness operator in the standard basis: 



m = ^{Mi-{aoh), (1530) 



leading to the desired result. There is another way to derive this result, which is more physically illuminating. The 
idea is to relate correlation functions to conditional calculation of expectation values. Let A = |ao) (ao| be a projector 
on state oq of the first subsystem, and let B some observable which is associated with the second subsystem. We can 
write the state of the whole system as an entangled superposition 

a 

Then it is clear that (A (g) B) = Pao{x'""'°^\B\x'''°'°^) ■ More generally if A = J^a \a)a{a\ is any operator then 

(A^B) = « (^^"^I^IX^"^) (1532) 

a 

Using this formula with j4 = = | t) (T I ^ I i) (i L a-nd B ^ ag we have = Pi = 1/2, and we get the same result 
for C{9) as in the previous derivation. 

^^^^= [53.5] Purity and the Von Neumann entropy 

The purity of a state can be characterized by the Von Neumann entropy: 

S[p] = —trace{p\ogp) = — ^p^fogPr (1533) 
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In the case of a pure state we have S[p] = 0, while in the case of a uniform mixture of states we have S[p] = log(A^). 
From the above it should be clear that while the "universe" might have zero entropy, it is likely that a subsystem 
would have a non-zero entropy. For example if the universe is a zero entropy singlet, then the state of each spin is 
unpolarized with log(2) entropy. 
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We would like to emphasize that the Von Neumann entropy S[p\ should not be confused with the Boltzmann entropy 
S'[p|^]. The definition of the latter requires to introduce a partitioning A of phase space into cells. In the quantum 
case this "partitioning" is realized by introducing a complete set of projectors (a basis). The pr in the case of the 
Boltzmann entropy are probabilities in a given basis and not eigenvalues. In the case of an isolated system out of 
equilibrium the Von Neumann entropy is a constant of the motion, while the appropriately defined Boltzmann entropy 
increases with time. In the case of a canonical thermal equilibrium the Von Neumann entropy S[p\ turns out to be 
equal to the thermodynamic entropy S. The latter is defined via the equation dQ ~ TdS, where T = 1//3 is an 
integration factor which is called the absolute temperature. 

If the Von Neumann entropy were defined for a classical distributions p ~ {Pr}, it would have all the classical "in- 
formation theory" properties of the Shanon entropy. In particular if we have two subsystems A and B one would 
expect 

S[p% S[p-] < S[p--] < S[p-]+S[p-] (1534) 

This property is also satisfied in the quantum mechanical case provided the subsystems are not entangled in a sense 
that we define below. 



[53.6] Quantum entanglement 



Let us consider a system consisting of two sub-systems, "A" and "B", with no correlation between them. Then, the 
state of the system can be factorizcd: 

= (1535) 

But in reality the state of the two sub-systems can be correlated. In classical statistical mechanics p^ and p^ are 
probability functions, while p^^^ is the joint probability function. In the classical state we can always write 

y)^Y. P^^^'i^'^ y')S.,.'Sy,y' (1536) 

x\y' 

where x and y labels classical definite states of subsystems A and B respectively. This means schematically that we 
can write 

(1537) 

r 

where r — {x' ,y') is an index that distinguish pure classical states of A® B, and Pr = p"^'^^{x' , y') are probabilities 
such that ^Pr = 1, and p'^'^^^ i-^ Sx,x' is a pure classical state of subsystem A, and p'^'^^ t-^ 5y_yi is a pure classical 
state of subsystem B. Thus any classical state oi A® B can be expressed as a mixture of product states. 

By definition a quantum state is not entangled if it is a product state or a mixture of product states. Using explicit 
matrix representation it means that it is possible to write 

C^-HPrf^'t^]^ (1538) 

r 

It follows that an entangled state, unlike a non-entangled state cannot have a classical interpretation. The simplest 
implication of non-entanglement is the validity of the entropy inequality that was mentioned in the previous section. 
We can use this mathematical observation in order to argue that the zero entropy singlet state is an entangled state: 
It cannot be written as a product of pure states, neither it cannot be a mixture of product states. 

The case where p^'^^ is a zero entropy pure state deserves further attention. As in the special case of a singlet, we 
can argue that if the state cannot be written as a product, then it must be an entangled state. Moreover for the 
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Schmidt decomposition procedure it follows that the entropies of the subsystems satisfy S[p^] = S[p^]. This looks 
counter intuitive at first sight because subsystem A might be a tiny device which is coupled to a huge environment 
B. We emphasize that the assumption here is that the "universe" B is prepared in a zero order pure state. Some 
further thinking leads to the conjecture that the entropy of the subsystems in such circumstance is proportional to the 
area of surface that divides the two regions in space. Some researchers speculate that this observation is of relevance 
to the discussion of black holes entropy. 



In elementary textbooks the quantum measurement process is described as inducing "collapse" of the wavefunction. 
Assume that the system is prepared in state Pi„itiai = ^^'^ that one measures P = If the result of the 

measurement is P = 1 then it is said that the system has collapsed into the state /9fi„ai = l<y')(<y'l- The probability for 
this "collapse" is given by the projection formula PToh{(p\ip) = \{(p\il;)\'^ . 

If one regard p{x,x') or ip{x) as representing physical reality, rather than a probability matrix or a probability 
amplitude, then one immediately gets into puzzles. Recalling the EPR experiment this world imply that once the 
state of one spin is measured at Earth, then immediately the state of the other spin (at the Moon) would change from 
unpolarized to polarized. This would suggest that some spooky type of "interaction" over distance has occurred. 

In fact we shall sec that the quantum theory of measurement docs not involve any assumption of spooky "collapse" 
mechanism. Once we recall that the notion of quantum state has a statistical interpretation the mystery fades away. 
In fact we explain (see below) that there is "collapse" also in classical physicsl To avoid potential miss- understanding 
it should be clear that I do not claim that the classical "collapse" which is described below is an explanation of the 
the quantum collapse. The explanation of quantum collapse using a quantum measurement (probabilistic) point of 
view will be presented in a later section. The only claim of this section is that in probability theory a correlation is 
frequently mistaken to be a causal relation: "smokers are less likely to have Alzheimer" not because cigarettes help 
to their health, but simply because their life span is smaller. Similarly quantum collapse is frequently mistaken to be 
a spooky interaction between well separated systems. 

Consider the thought experiment which is known as the "Monty Hall Paradox" . There is a car behind one of three 
doors. The car is like a classical "particle" , and each door is like a "site" . The initial classical state is such that the car 
has equal probability to be behind any of the three doors. You are asked to make a guess. Let us say that you peak 
door #1. Now the organizer opens door #2 and you sec that there is no car behind it. This is like a measurement. 
Now the organizer allows you to change your mind. The naive reasoning is that now the car has equal probability to 
be behind either of the two remaining doors. So you may claim that it docs not matter. But it turns out that this 
simple answer is very very wrong! The car is no longer in a state of equal probabilities: Now the probability to find it 
behind door #3 has increased. A standard calculation reveals that the probability to find it behind door #3 is twice 
large compared with the probability to find it behind door #2. So we have here an example for a classical collapse. 

If the reader is not familiar with this well known " paradox" , the following may help to understand why we have this 
collapse (I thank my colleague Eitan Bachmat for providing this explanation). Imagine that there are billion doors. 
You peak door #1. The organizer opens all the other doors except door #234123. So now you know that the car is 
either behind door #1 or behind door #234123. You want the car. What are you going to do? It is quite obvious that 
the car is almost definitely behind door #234123. It is also clear the that the collapse of the car into site #234123 
does not imply any physical change in the position of the car. 



What do we mean by quantum measurement? In order to clarify this notion let us consider a system and a detector 
which are prepared independently as 



[53.7] Measurements, the notion of collapse 



[53.8] Quantum measurements, Schroedinger's cat 




(1539) 
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As a result of an interaction we assume that the detector correlates with the system as follows: 

?>_e„e„t* = ^Va|a)<8|g = a) (1540) 

We call such type of unitary evolution ideal projective measurement. If the system is in a definite a state, then it is not 
affected by the detector. Rather, we gain information on the state of the system. One can think of q as representing 
a memory device in which the information is stored. This memory device can be of course the brain of a human 
observer. Form the point of view of the observer, the result at the end of the measurement process is to have a 
definite a. This is interpreted as a "collapse" of the state of the system. Some people wrongly think that "collapse" 
is something that goes beyond unitary evolution. But in fact this term just makes over dramatization of the above 
unitary process. 

The concept of measurement in quantum mechanics involves psychological difficulties which are best illustrated by 
considering the "Schroedinger's cat" experiment. This thought experiment involves a radioactive nucleus, a cat, and 
a human being. The half life time of the nucleus is an hour. If the radioactive nucleus decays it triggers a poison 
which kills the cat. The radioactive nucleus and the cat are inside an isolated box. At some stage the human observer 
may open the box to see what happens with the cat... Let us translate the story into a mathematical language. A 
time t ^ the state of the universe (nucleus(8)cat(8)observer) is 

= I 1= radioactive) (E) \q = 1 = alive) (E) \Q = = ignorant) (1541) 

where q is the state of the cat, and Q is the state of the memory bit inside the human observer. If we wait a very 
long time the nucleus would definitely decay, and as a result we will have a definitely dead cat: 

[^waiting* = U= decayed) (g) |g = -1 = dead) ® IQ = = ignorant) (1542) 

If the observer opens the box he/she would see a dead cat: 

C^Bccingf/waiting* = |t=decaycd) (g) = -1 = dead) (g) IQ = -1 = shocked) (1543) 

But if we wait only one hour then 

1 



^ waiting r— 

v2 



I T) «) k = +1) + li) «) |g = -1) (g IQ = = ignorant) (1544) 



which means that from the point of view of the observer the system (nucleus+cat) is in a superposition. The cat at 
this stage is neither definitely alive nor definitely dead. But now the observer open the box and we have: 



U U ■ — — 

^ seeing^ waiting ^ \/2 



I t) ® |g = +1) ® IQ = +1) + U) ® |g = -1) (g IQ = -1) (1545) 



We see that now, form the point of view of the observer, the cat is in a definite(!) state. This is regarded by the 
observer as "collapse" of the superposition. We have of course two possibilities: one possibility is that the observer sees 
a definitely dead cat, while the other possibility is that the observer sees a definitely alive cat. The two possibilities 
"exist" in parallel, which leads to the "many worlds" interpretation. Equivalently one may say that only one of the 
two possible scenarios is realized from the point of view of the observer, which leads to the "relative state" concept 
of Everett. Whatever terminology we use, "collapse" or "many worlds" or "relative state", the bottom line is that we 
have here merely a unitary evolution. 



^^^^= [53.9] Measurements, formal treatment 

In this section we describe mathematically how an ideal projective measurement affects the state of the system. First 
of all let us write how the U of a measurement process looks like. The formal expression is 

^^inoasuicmont = ^ P^"' g) I^^"' (1546) 



254 



where P^") = is the projection operator on the state |a), and is a translation operator. Assuming that the 

measurement device is prepared in a state of ignorance \q = 0), the effect of D^"' is to get \q = a). Hence 



Y,^a'\a')®\q = 0) 



(1547) 



A more appropriate way to describe the state of the system is using the probability matrix. Let us describe the above 
measurement process using this language. After "reset" the state of the measurement apparatus is ct^^^ = \q=0){q=0\. 
The system is initially in an arbitrary state p. The measurement process correlates that state of the measurement 
apparatus with the state of the system as follows: 



(1548) 



a.b 



a.b 



Tracing out the measurement apparatus we get 



(1549) 



Where pa is the trace of the projected probabihty matrix p(°)pp(°), while p^") is its normalized version. We see that 
the effect of the measurement is to turn the superposition into a mixture of a states, unlike unitary evolution for 
which 



U nU^ 



(1550) 



So indeed a measurement process looks like a non-unitary process: it turns a pure superposition into a mixture. A 
simple example is in order. Let us assume that the system is a spin 1/2 particle. The spin is prepared in a pure 
polarization state p = IV') ("01 which is represented by the matrix 



Pab 



(1551) 



where 1 and 2 are (say) the "up" and "down" states. Using a Stern-Gerlach apparatus we can measure the polarization 
of the spin in the up/down direction. This means that the measurement apparatus projects the state of the spin using 



p(i) 



1 




and 



p(2 




1 



(1552) 



leading after the measurement to the state 



(2) 



|0iP 

|02| = 



(1553) 



Thus the measurement process has eliminated the off-diagonal terms in p and hence turned a pure state into a mixture. 
It is important to remember that this non-unitary non-coherent evolution arise because we look only on the state of 
the system. On a universal scale the evolution is in fact unitary. 
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[54] Theory of quantum computation 

== [54.1] Motivating Quantum Computation 



Present day secure communication is based on the RSA two key encryption method. The RSA method is based on 
the following observation: Let N be the product of two unknown big prime numbers p and q. Say that we want to 
find are what are its prime factors. The simple minded way would be to try to divide N by 2, by 3, by 5, by 7, and 
so on. This requires a huge number (~ N) of operations. It is assumed that A'' is so large such that in practice the 
simple minded approached is doomed. Nowadays we have the technology to build classical computers that can handle 
the factoring of numbers as large as TV ~ 2^*^" in a reasonable time. But there is no chance to factor larger numbers, 
say of the order N ~ 2'^"^. Such large numbers are used by Banks for the encryption of important transactions. In 
the following sections we shall see that factoring of a large number N would become possible once we have a quantum 
computer. 

Computational complexity: A given a number N can be stored in an n-bit register. The size of the register should 
be 71 ^ log(A^), rounded upwards such that A^ < 2". As explained above in order to factor a number which is stored 
in an n bit register by a classical computer we need an exponentially large number N) of operations. Obviously 
we can do some of the operations in parallel, but then we need an exponentially large hardware. Our mission is to 
find an efficient way to factor an n-bit number that do not require exponentially large resources. It turns out that a 
quantum computer can do the job with hardware/time resources that scale like power of n, rather than exponential 
in n. This is done by finding a number N2 that has a common divisor with A^. Then it is possible to use Euclid's 
algorithm in order to find this common divisor, which is either p or q. 

Euclid's algorithm: There is no efficient algorithm to factor a large number N ^ 2". The classical computational 
complexity of this task is exponentially large in n. But if we have two numbers A^i = N and N2 we can quite easily 
and efficiently find their greater common divisor GCD(A^i, A^2) using Euclid's algorithm. Without loss of generality we 
assume that A^i > A^2- The two numbers are said to be co-prime if GCD(A^i, A2) = 1. Euclid's algorithm is based on 
the observation that we can divide A^^i by A^2 and take the reminder A^3 = mod(A^i, A2) which is smaller than 7V2. Then 
we have GCD(A^i, A^2) = GCD(7V2, N3). We iterate this procedure generating a sequence Ni > N2 > N3 > > ■ ■ ■ 
until the reminder is zero. The last non-trivial number in this sequence is the greater common divisor. 

The RSA encryption method: The RSA method is based on the following mathematical observation. Given two 
prime numbers p and q define N = pq. Define also a and b such that ab = I mod [{p — — 1)]. Then we have the 
relations 

B A" mod [A^] (1554) 
A = B'' mod [A^] (1555) 

This mathematical observation can be exploited as follows. Define 

public key (TV, a) (1556) 
private key = {N,b) (1557) 

If anyone want to encrypt a message A, one can use for this purpose the public key. The coded message B cannot be 
decoded unless one knows the private key. This is based on the assumption that the prime factors p and q and hence 
b are not known. 



^=^= [54.2] The factoring algorithm 

According to Fermat's theorem, if A^ is prime, then = M mod (A') for any number M(< A'). This can be 
re-phrased as saying that the period of the function /(x) = mod (A^) is r = A^— 1. That means f{x + r) = f{x). 
If A^ is co-prime to M, but not necessarily prime, then the period of f{x) would be smaller than A^, because mod A^ 
is a finite group. This integer period r is called "the order" . 

The quantum computer is a black box that allows to find the period r of a function f{x). How this is done will be 
discussed in the next section. Below we explain how this ability allows us to factor a given large number A^. 
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(1) We have to store N inside an n-bit register. 

(2) We pick a large number M which is smaller than N. We assume that M is co-prime to N. This assumption can 
be easily checked using Euclid'd algorithm. If by chance the chosen M is not co-prime to N then we are lucky 
and we can factor N without quantum computer. So we assume that we are not lucky, and M is co-prime to N. 

(3) We build a processor that can calculate the function f{x) — AI^ mod (TV). This function has a period r which 
is smaller than TV. 

(4) Using a quantum computer we find one of the Fourier components of /(x) and hence its period r. This means 
that M'' = 1 mod (TV). 

(5) If r is not even we have to run the quantum computer a second time with a different M . There is a mathematical 
theorem that guarantees that with probability of order one we should be able to find M for which r is even. 

(6) Assuming that r is even we define Q = Af/^ mod {N). We have = 1 mod (TV), and therefore 
(Q — 1)(<5 + 1) = mod (TV). Consequently both {Q — 1) and {Q + 1) must have either p or g as common 
divisors with TV. 

(6) Using Euclid's algorithm we find the GCD of TV and TV = (Q — 1), hence getting either p or q. 

The bottom line is that given TV and TV/ an an input, we would like to find the period r of the functions 

f{x) = W mod (TV) (1558) 

Why do we need a quantum computer to find the period? Recall that the period is expected to be of order TV. 
Therefore the x register should be bits, where ric is larger or equal to n. Then we have to make order of 2""= 
operations for the purpose of evaluating f{x) so as to find out its period. It is assumed that n is large enough such 
that this procedure is not practical. We can of course try to do parallel computation of f{x). But for that we need 
hardware which is larger by factor of 2". It is assumed that to have such computational facility is equally not practical. 
We say that factoring a large number has an exponentially complexity. 

The idea of quantum processing is that the calculation of f{x) can be done "in parallel" without having to duplicate 
the hardware. The miracle happens due to the superposition principle. A single quantum register of size Uc can be 
prepared aX t = with all the possible input x values in superposition. The calculation of f{x) is done in parallel 
on the prepared state. The period of f{x) in found via a Fourier analysis. In order to get good resolution Uc should 
be larger than n so as to have 2""= ^ 2". Neither the memory, nor the number of operation, are required to be 
exponentially large. 

== [54.3] The quantum computation architecture 

We shall regard the computer as a machine that has registers for memory and gates for performing operations. The 
complexity of the computation is determined by the total number of memory bits which has to be used times the 
number of operations. In other words the complexity of the calculation is the product memory x time. As discussed 
in the previous section classical parallel computation do not reduce the complexity of a calculation. Both classical 
and quantum computers work according to the following scheme: 

loutput) = f7[input] |0) (1559) 

This means that initially we set all the bits or all the qubits in a zero state (a reset operation), and then we operate 
on the registers using gates, taking the input into account. It is assumed that there is a well define set of elementary 
gates. An elementary gate (such as "AND") operates on few bits (2 bits in the case of AND) at a time. The size of 
the hardware is determined by how many elementary gates are required in order to realize the whole calculation. 

The quantum analog of the digital bits (" 0" or " 1" ) are the qubits, which can be regarded as spin 1/2 elements. These 
are "analog" entities because the "spin" can point in any direction (not just "up" or "down"). The set of states such 
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that each spin is aHgned either "up" or "down" is called the computational basis. Any other state of a register can be 
written a superposition: 



I*) 



(1560) 



The architecture of the quantum computer which is requited in order to find the period r of the function f{x) is 
illustrated in the figure below. We have two registers: 



X = (a;o,a:;i,X2, ...,a;„^_i) 

y = {yo,yo,y2,-,yn-i) 



(1561) 
(1562) 



The registers x and y can hold binary numbers in the range x < Nc and y < N respectively, where A^c = 2""= and 
N = 2" ^ N. The y register is used by the CPU for processing mod(A^) operations and therefore it requires n bits. 
The X register has Uc bits and it is used to store the inputs for the function f{x). In order to find the period of f{x) 
the size Uc of the latter register should be some number (say 100) times n. Such large Uc is required if we want to 
determine the period with large accuracy. 
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We are now ready to describe the quantum computation. In later sections we shall give more details, and in par- 
ticular we shall see that the realization of the various unitary operations which are listed below does not require an 
exponentially large hardware. The preliminary stage is to make a "reset" of the registers, so as to have 

I*) = = |0, 0,0,. ..,0,0; 1,0,0, ...,0,0) (1563) 

Note that it is convenient to start with y = 1 rather than y = Q. Then come a sequence of unitary operations 

U = UfUmUh (1564) 

where 



Uh 
Um 



U 



(1565) 
(1566) 



Uf 



(1567) 



The first stage is a unitary operation Uh that sets the x register in a democratic state. It can realized by operating 
on with the Hadamard gate. Disregarding normalization we get 



1^) = Y.\^)®\y^^) 

X 



(1568) 
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The second stage is an x controlled operation Um- This stage is formally like a quantum measurement: The x register 
is "measured" by the y register. The result is 

X 

Now the y register is entangled with the x register. The fourth stage is to perform Fourier transform on the x register: 



E 



(1570) 



We replace the dummy integer index a;' by fc = {2t: /Nc)x' and re- write the result as 



I*) = Ei^) 



Y,e^'^\f{x)) 



Ak)\ 



k 

The final stage is to measure the x register. The probability to get k as the result is 

Pk = E«'''i/(^)) 



(1571) 



(1572) 



The only non-zero probabilities are associated with k = integer x {2n/r). Thus we are likely to find one of these k 
values, from which we can deduce the period r. Ideally the error is associated with the finite length of the x register. 
By making the x register larger the visibility of the Fourier components becomes better. 



[54.4] Elementary quantum gates 



The simplest gates are one quibit gates. They can be regarded as spin rotations. Let us list some of them: 



1 

c''"/^ 



(1573) 



5 = = 
Z = = 



1 

i 

1 

-1 



X 
Y 
H 
R 



1 

1 

-i 

1 



-inSy 



NOT gate 



—iz f I "'^j^ j = —-={a,r, + <Ty) = ie = Hadamard gate 



^( \ 1 



x/2 

^(1 - iGy) = e-'^'^/^)^" = 90deg Rotation 
v2 



We have i?"* = — 1 which is a 27r rotation in SU{2). We have = Y^ = Z'^ = H'^ = 1 which imphes that these are tt 
rotations in U{2). We emphasize that though the operation of H on "up" or "down" states looks like 7r/2 rotation, 
it is in fact a tt rotation around a 45° inclined axis: 



V2' ' V2 



(1574) 
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More interesting are elementary gates that operate on two qubits. In classical computers it is popular to use "AND" 
and "OR" gates. In fact with "AND" and the one-bit operation "NOT" one can build any other gate. The "AND" 
cannot be literally realized as quantum gate because it does not correspond to unitary operation. But we can build 
the same logical circuits with "CNOT" (controlled NOT) and the other one-qubit gates. The definition of CNOT is 
as follows: 



1 
1 







1 

1 



(1575) 



The control bit is not affected while the y bit undergoes NOT provided the a; bit is turned on. The gate is schematically 
illustrated in the following figure: 



y + X 



I is amusing to see how SWAP can be realized by combining 3 CNOT gates: 



/I 



Us 



1 

1 



(1576) 



which is illustrated is the following diagram: 



X 2x+y 2x+y 




y x+y x+y 3x+2y 



The generalization of CNOT to the case where we have two qubit control register is known as Toffoli. The NOT 
operation is performed only if both control bits are turned on: 



H -^^T' T -^^T' T - H 



S 



The realization of the Toffoli gate opens the way to the quantum realization of an AND operation. Namely, by setting 
y = at the input, the output would he y = xi /\ X2- For generalization of the Toffoli gate to the case where the x 
register is of arbitrary size see p. 184 of Nielsen and Chuang. 
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== [54.5] The Hadamard Transform 

In the following we discuss the Hadamard and the Fourier transforms. These are unitary operations that are defined 
on the multi-qubit x register. A given basis state \xq,xi^x^, ...) can be regarded as the binary representation of an 
integer number: 



(1577) 



We distinguish between the algebraic multiplication for which we use the notation xx' ^ and the scalar product for 
which we use the notation x ■ x' , 



r 

XX — ^ Xf^Xg^ 



(1578) 



So far we have defined the single qubit Hadamard gate. If we have an multi-qubit register it is natural to define 



H ® H ® H ® ■ ■ ■ 



(1579) 



The operation of a single-qubit Hadamard gate can be written as 



— (|o) + (-i)^Mi)) 



(1580) 



If we have a multi-qubit register we simply have to perform (in parallel) an elementary Hadamard transform on each 
qubit: 



1-^0 ; -^1 ; ■ ■ ■ 5 : ■ ■ ■) 



^ n;^(io)+(-ir'ii))-;7l^7n 



fcr=0,l 



1 ^ (-l)'=°-°+'^-^-^+-|fco,A:i,...,fc,,...) = -l=E(-lfnA) 

K-Q.ki,... k 



(1581) 



The Hadmard transform is useful in order to prepare a "democratic" superposition state as follows: 



|0,0, ...,0) ^ -^(|0) + |1)) ® -^(|0) + |1)) ® ... ® -^(|0) + |1)) ^ 



/1\ 

1 



Vi/ 



(1582) 



To operate with a unitary operator on this state is like making parallel computation on all the possible x basis states. 
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[54.6] The quantum Fourier transform 



The definitions of the Hadamard transform and the quantum Fourier transform are very similar in style: 

t/H._a|3.) = -^^(-l)^--|fc) (1583) 
C/p_i„|x) = -±^ye-'^''^\k) (1584) 



Let us write the definition of the quantum Fourier transform using diiferent style so as to see that it is indeed a 
Fourier transform operation in the conventional sense. First we notice that its matrix representation is 



1 



(x'|C/p_,|x) = ^=e 



Nr. 



(1585) 



If we operate with it on the state \tp) = i^xlx) we get \(p) = J2x where the column vector ip^ is obtained 

from ipx by a multiplication with the matrix that represents J/Fourid- Changing the name of the dummy index form x 
to k we get the relation 



(1586) 



^ x=0 



This is indeed the conventional definition of 



FT 



'■P2 



(1587) 



The number of memory bits which are required to store these vectors in a classical register is of order iV ~ 2". The 
number of operations which is involved in the calculation of a Fourier transform seems to be of order N'^. In fact 
there is an efficient "Fast Fourier Transform" (FFT) algorithm that reduces the number of required operations to 
NlogN ~ n2". But this is still an exponentially large number in n. In contrast to that a quantum computer can 
store these vectors in n qubit register. Furthermore, the "quantum" FT algorithm can perform the calculation with 
only rt^ log n log log 71 operations. We shall not review here how the Quantum Fourier transform is realized. This 
can be found in the textbooks. As far as this presentation is concerned the Fourier transform can be regarded as a 
complicated variation of the Hadamard transform. 



^^^^= [54.7] Note on analog or optical computation 

A few words are in order here regarding quantum computation versus classical analog computation. In an analog 
computer every analog "bit" can have a voltage within some range, so ideally each analog bit can store infinite amount 
of information. This is of course not the case in practice because the noise in the circuit defines some effective finite 
resolution. Consequently the performance is not better compared with a digital computers. In this context the analog 
resolution is a determining factor in the definition of the memory size. Closely related is optical computation. This 
can be regarded as a special type of analog computation. Optical Fourier Transform of a "mask" can be obtained 
on a "screen" that is placed in the focal plane of a lens. The FT is done in one shot. However, also here we have 
the same issue: Each pixel of the mask and each pixel of the screen is a hardware element. Therefore we still need 
an exponentially large hardware just to store the vectors. At best the complexity of FT with optical computer is of 
order 2". 
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[54.8] The Um operation 



The CNOT/TofFoli architecture can be generalized so as to reahze any operation of the type y = f{xi,X2, ■■■), as an 
x-controllcd operation, where j/ is a single qubit. More generally we have 



X = (a:o,a;i,a;2...x„^_i) 

y = {yo,yi,y2,--yn-i) 



(1588) 
(1589) 



and we would like to realize a unitary controlled operation 



U = ^ |x) {x\ «) = P("' ® + P(i' eg) C/(i) + P(2) ^ ui-2) 



(1590) 



This is formally like a measurement of the x register by the y register. Note that x is a constant of motion, and that 
U has a block diagonal form: 



/C/(o) 



{x ,y'\U\x,y) = S^'^^Uy, 



(x) 



•7 



(1591) 



Of particular interest is the realization of a unitray operation that maps y = 1 to y = f{x). Let us look on 



^ M y 



f{x)y mod(iV) 



(1592) 



If M is co-prime to N then U is merely a permutation matrix, and therefore it is unitary. The way to realize this 
operation is implied by the formula 



Tic — i- 



(1593) 



s=0 



which requires ric stages of processing. The circuit is illustrated in the figure below. In the s stage we have to perform 
a controlled multiplication of y by Ms = A/^ mod {N). 
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M, 



IVPy 
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[55] The foundation of statistical mechanics 
== [55.1] The canonical formalism 



Consider some system, for example particles that are confined in a box. The Hamiltonian is 

n = n{r,p-X) (1594) 

where X is some control parameter, for example the length of the box. The energy of the system is defined as 

E = (H) = traceiHp) = ^PrEr (1595) 

r 

where is the last equality we have assume that we are dealing with a stationary state. Similarly the expression for 
the generalized force y is 

It is argued that the weak interaction with an environment that has a huge density of state q„„ leads after relaxation 
to a canonical state which is determined by the parameter /3 ~ dlog{g^„^{E)) /dE that characterizes the environment. 
The argument is based on the assumption that the universe (systcm+environment) is a closed system with some total 
energy i^totai- After ergodization the system get into a stationary- like state. The probability Pr to find the system in 
state Er is proportional to f?onv(^'totai— ^'r) ~ f?onv(-E'totai)e~'^^''. Accordingly 

Pr = le-'^^- (1597) 
where the so-called partition function provides the normalization 

Z(/3;X) = ^e-''^- (1598) 

r 

One observes that the energy can be obtained from 
dlnZ 

E = (1599) 
while the generalized force is 

If we slightly change X and (3 and assume that the state of the system remains canonical then 

dE = ^dprEr + Y^PrdEr = TdS-ydX (1601) 

r 

where the absolute temperature is defined as the integration factor of the first term 

T — integration factor — — (1602) 

P 
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and the implied definition of the thermodynamic entropy is 

S = -^prlnpr (1603) 

Note that the thermodynamic entropy is an extensive quantity in the thermodynamic limit. At this state it is 
convenient to define the Helmholtz generating function 

FiT,X) = -ilnZ(/3;X) (1604) 

which allows to write the state equation in an elegant way: 
dF 

S - -gf (1605) 
dF 

y = (1606) 

and 

E = F + TS (1607) 



[55.2] Work 



In the definition of work the system and the environment are regarded as one driven closed unit. On the basis of the 
"rate of change formula" we have the following exact expression: 

W = -j{J')tdX (1608) 

where = —dH/dX. Note that {!F)t is calculated for the time dependent (evolving) state of the system. From linear 
response theory of driven closed systems we know that in leading order 

(^)t ~ {r)x ~vX (1609) 

The first terms is the conservative force, which is a function of X alone. The subscript implies that the expectation 
value is taken with respect to the instantaneous adiabatic state. The second term is the leading correction to the 
adiabatic approximation. It is the "friction" force which is proportional to the rate of the driving. The net conservative 
work is zero for a closed cycle while the "friction" leads to irreversible dissipation of energy with a rate •qX'^. 

The above reasoning implies that for a quasi static process we can calculate the work as the sum of two contributions: 
W = — + Wirrovoraibio- Thc couservativc work is defined as 

W ^ ( y{X)dX (1610) 



The rate of irreversible work is 

= VX"" (1611) 
where 77 is the "friction" coefficient, which can be calculated using linear response theory. 
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[55.3] Heat 



In order to understand which type of statements can be extracted form the canonical formahsm we have to discuss 
carefully the physics of work and heat. We distinguish between the system and the environment and write the 
Hamiltonian in the form 

= ?^(r-,p;X(t)) +H.„. +K„v (1612) 

It is implicit that the interaction term is extremely small so it can be ignored in the calculation of the total energy. 
We define 

W = work= - (1613) 

Q = heat = -((K„v)b - (K„v)a) (1614) 
i^wi - £^.„H..i = {n)B-{n)A = Q + W (1615) 

If for a general process we know the work W and the change in the energy of the system, then wc can deduce 
what was the heat flow Q. If we compare dE = TdS — ydX with the expression dE = dQ + dW we deduce that 
TdS =dQ +(fWi„ovoraibic- Tlils implics that the change in the entropy of the system is: 

5.,„.i - 5.„uiai = / ^f'JQ+'fW.^^,,^) (1616) 



A T 



Optionally we can write 



/: 



B 

TdS = Q + W,„evc,.™e (1617) 



This is the change in the energy dprEr which is resulted from transitions between levels. These transitions are 
induced by the coupling to the environment and/or by the non-adiabaticity of the driving. 

== [55.4] The second law of thermodynamics 



There are two possible approaches to discuss irreversible processes and the notion of entropy. One is based on the 
theory of ergodic motion, while the other on the above thermodynamic approach. We summarize both in this section, 
and exploit the opportunity to distinguish between Shanon entropy, Von-Newman entropy, Boltzmann entropy and 
Thermodynamic entropy. All are defined by the same look alike formula S* = — "^^Pr log(pr) but the context and the 
meaning of the pr is in general not the same. 

If we have a probability distribution {pr} then S provides a measure for the uncertainty which is involved in our 
knowledge of the statistical state. This point of view that regards S as an information measure has been promoted 
by Shanon. In the quantum mechanical context we define "classical state" as implying 100% certainty for any 
measurement. Such states do not exist in Nature. Rather the states of minimum uncertainty in N dimensional Hilbert 
space are the pure states, and they have finite information entropy. See http://arxiv.org/abs/quant-ph/0401021. 
They should be contrasted with the worst mixed state whose entropy is = log(7V). Irrespective of that, Von-Newman 
has used a look-alike formula in order to characterize the purity of the state. In the Von-Newman definition of purity 
the Pr are the weights of pure states in the mixture. It is important to realize that the Von-Newman entropy has 
nothing to do with the theory of irreversibility. If we consider for example the free expansion of a wavepackct in a big 
chaotic box, then we have 5 = at any moment. Still it is nice that the Von-Newman entropy of a canonical state 
coincides with the thermodynamic definition. 

It was Boltzmann who has defined S" is a way that allows to discuss irreversibility. The idea was to divide the phase 
space of a system into small cells, and to characterize the statistical state in this representation using S. Then 
Boltzmann has proposed the "H theorem" stating that S has the tendency to increase with time if the motion is 
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chaotic. The same reasoning can be carried out in the quantum mechanical context where the division of Hilbcrt 
space into "cells" is defined by a complete set of projectors. Obviously in the latter context recurrences imply that 
the dynamics of a quantized closed chaotic system looks irreversible only for a limited time. 

Coming back to the thermodynamic definition of entropy we would like to see how it is related to the theory of 
irreversibility. For this purpose let us discuss again its definition within the framework of the canonical formalism. 
The state of a system in equilibrium is represented by a "point" in the canonical space {f3,X), where X is a set of 
control parameters. If we have a closed system we can take it from state A to state B adiabatically in a reversible 
manner. This way we can define set of adiabatic surfaces: 



dfi- 



dPr p 



dX ^ 



(1618) 



We can label each adiabatic surface by a number that we would like to call entropy. The convention would be that 
S[A] < S[B] if we can get from A to B via an irreversible process. Then it is possible to re-write the left hand side 
of the above equation as T{(3, X)dS , where the implied definition of the function T{p,X) depends on our somewhat 
arbitrary definition of the function S{P, X). 

At this stage the definition of the entropy function is not unique, and the function T(/?, X) might depended on X. By 
definition the adiabatic surfaces are described by the equation S{(3, X) = const. For a reversible quasi static process 
we can make the identification (JQ = X)dS. During a reversible quasi-static process the change of the entropy of 
the system is / dQ/T^y^{f}, X,y^) while that of the bath is — / (?(5/Tb^th(/3, ^bath)- We would like to define the entropy 
of the combined system-|-bath as the sum of the respective entropies, and to have zero entropy change if the process 
is reversible. This wishful thinking can be realized if the so-called integration factor T(/3, X) is a function of (3 alone. 
Indeed the canonical formalism implies that such integration factor exists, leading to the canonical definitions of 
absolute temperature and thermodynamic entropy. 



== [55.5] Fluctuations 

The partition function and hence the thermodynamic equations of state give information only on the spectrum {En} 
of the system. In the classical context this observation implies that a magnetic field has no influence on the equilibrium 
state of a system because the spectrum remains E = mu^/2 with < < oo. In order to probe the dynamics we have 
to look on the fluctuations S{t) = {J^{t)!F{0)) , where is some observable. The Fourier transform of S{t) describes 
the power spectrum of the fluctuations: 

/oo 
Sity'^^dT = 5^p„;^ |.F,„„p 2ttS{uj - {E,n-En)) (1619) 
-°° 11 m 

This is the same object that appears in the Fermi-Golden-rulc for the rate of transitions due to a perturbation 
term V = —f{t)!F. In the above formula lj > Q corresponds to absorption of energy (upward transitions), while 
w < corresponds to emission (downward transitions). It is a straightforward algebra to show that for a canonical 
preparations with pn cx cxp{~En/T) there is a detailed balance relation: 

S{cj) - Si-io) exp i—j (1620) 

This implies that if we couple to the system another test system (e.g. a two level "thermometer") it would be driven 
by the fluctuations into a canonical state with the same temperature. 

The connection with Fermi- Golden-rule is better formalized within the framework of the so called fluctuation- 
dissipation relation. Assume that the system is driven by varying a parameter X, and define as the associated 
generalized force. The Kubo formula (sec Dynamics and driven systems lectures) relates the response kernel to S{t). 
In particular the dissipation coefficient is: 



(1621) 
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If the system is in a canonical state it follow that the zero frequency response is rjo = S{0)/{2T). If we further assume 
"Ohmic response" , which means having constant ri(uj) = 770 up to some cutoff frequency, then the above relation can 
be inverted: 

= ^0—^ (1622) 



The best known application of this relation is known as the Nyquist theorem. If a ring is driven by an electro-motive 
force — $, then the rate of heating is VV = G<i>^, which is know as Joule law. The generalized force which is associated 
with $ is the current T, and G is known as the conductance. Note that Joule law is equivalent to Ohm law (X) = —G^. 
It follows from the fluctuation-dissipation relation that the fluctuations of the current at equilibrium for uj -^T are 
described by S{uj) — 2GT. It should be clear that in non-equilibrium we might have extra fluctuations, which in this 
example are known as shot noise. 



[55.6] The modeling of the environment 



It is common to model the environment as a huge collection of harmonic oscillators, and to say that the system if 
subject to the fluctuations of a field variable J- which is a linear combination of the bath coordinates: 



1/2 



(fla -I- al) 



(1623) 



For preparation of the bath in state n = {ua} we get 



(1624) 



a ± 



Using 



{na + l\Qa\na) 



1 



2maUJa 



1/2 



{na-l\Qa\na) 



1 



we get 



2maUJa 



-2nci 



1/2 



(1625) 
(1626) 



(1+71q)(5((jJ — bJa) + na5{ijJ + Wa) 



2mQa'Q 

For a canonical preparation of the bath we have [ua) — /(wq) = l/(e'^/-'" ^1)- H follows that 
~S{u) = 2J(|c.|)x|(1+/(^^)) = 2J{lo) ^ 



(1627) 



1 - e-P"^ 



(1628) 



where we used /(— w) — —(1 + f{ujj), and defined the spectral density of the bath as 



(1629) 



with anti-symmetric continuation. For an Ohmic bath J{lo) — rjuj, with some cutoff frequency lOc- 



